Fusion proteins comprising enzyme replacement therapy enzymes

ABSTRACT

Provided herein are fusion proteins that comprise an enzyme replacement therapy enzyme and an Fc region, as well as methods of using such proteins to treat a lysosomal storage disorder. Methods for transporting agents across the blood-brain barrier are also provided herein.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation of International Application No. PCT/US2018/053747, filed Oct. 1, 2018, which claims priority to U.S. Provisional Application No. 62/566,898, filed on Oct. 2, 2017, U.S. Provisional Application No. 62/583,276, filed on Nov. 8, 2017, U.S. Provisional Application No. 62/626,365, filed on Feb. 5, 2018, U.S. Provisional Patent No. 62/678,183, filed on May 30, 2018, and U.S. Provisional Application No. 62/721,396, filed on Aug. 22, 2018, the disclosures of which are incorporated herein by reference in their entirety for all purposes.

SEQUENCE LISTING

The instant application contains a Sequence Listing which has been submitted electronically in ASCII format and is hereby incorporated by reference in its entirety. Said ASCII copy, created on Sep. 28, 2018, is named 102342-000350PC-1103949_SL.txt and is 580,464 bytes in size.

BACKGROUND

Lysosomal storage disorders (LSDs) are relatively rare inherited metabolic diseases that result from defects in lysosomal function. LSDs are typically caused by the deficiency of a single enzyme that participates in the breakdown of metabolic products in the lysosome. The buildup of the product resulting from lack of the enzymatic activity affects various organ systems and can lead to severe symptoms and premature death. The majority of LSDs also have a significant neurological component, which ranges from progressive neurodegeneration and severe cognitive impairment to epileptic, behavioral, and psychiatric disorders. A recombinant form of an enzyme that is deficient in an LSD can be used to treat the disorder, but such therapies may have little effect on the brain due to difficulties in delivering the recombinant enzyme across the blood-brain barrier (BBB).

SUMMARY

Provided herein are fusion proteins comprising enzyme replacement therapy (ERT) enzymes and methods of use thereof for treating lysosomal storage disorders (LSDs).

In some aspects, provided herein is a protein comprising:

-   -   (a) a first Fc polypeptide that is linked to an ERT enzyme, an         ERT enzyme variant, or a catalytically active fragment thereof;         and     -   (b) a second Fc polypeptide that forms an Fc dimer with the         first Fc polypeptide.

In some embodiments, the first Fc polypeptide and/or the second Fc polypeptide does not include an immunoglobulin heavy and/or light chain variable region sequence or an antigen-binding portion thereof.

In some embodiments, the ERT enzyme is iduronate 2-sulfatase (IDS), an IDS variant, or a catalytically active fragment thereof. In some embodiments, the ERT enzyme comprises an amino acid sequence having at least 80%, 85%, 90%, or 95% identity to the amino acid sequence of any one of SEQ ID NOS:91, 92, 114, 230, and 234. In some embodiments, the ERT enzyme comprises the amino acid sequence of any one of SEQ ID NOS:91, 92, 114, 230, and 234.

In some embodiments, the ERT enzyme is N-sulfoglucosamine sulfohydrolase (SGSH), an SGSH variant, or a catalytically active fragment thereof. In some embodiments, the ERT enzyme comprises an amino acid sequence having at least 80%, 85%, 90%, or 95% identity to the amino acid sequence of any one of SEQ ID NOS:119 and 120. In some embodiments, the ERT enzyme comprises the amino acid sequence of any one of SEQ ID NOS:119 and 120.

In some embodiments, the ERT enzyme is acid sphingomyelinase (ASM), an ASM variant, or a catalytically active fragment thereof. In some embodiments, the ERT enzyme comprises an amino acid sequence having at least 80%, 85%, 90%, or 95% identity to the amino acid sequence of any one of SEQ ID NOS:121, 122, and 123. In some embodiments, the ERT enzyme comprises the amino acid sequence of any one of SEQ ID NOS:121, 122, and 123.

In some embodiments, the ERT enzyme is β-glucocerebrosidase (GBA), a GBA variant, or a catalytically active fragment thereof. In some embodiments, the ERT enzyme comprises an amino acid sequence having at least 80%, 85%, 90%, or 95% identity to the amino acid sequence of any one of SEQ ID NOS:93 and 94. In some embodiments, the ERT enzyme comprises the amino acid sequence of any one of SEQ ID NOS:93 and 94.

In some embodiments, the first Fc polypeptide is a fusion polypeptide that is linked to the ERT enzyme, the ERT enzyme variant, or the catalytically active fragment thereof by a peptide bond or by a polypeptide linker. In some embodiments, the polypeptide linker is a flexible polypeptide linker. In some embodiments, the flexible polypeptide linker is a glycine-rich linker. In some embodiments, the glycine-rich linker is G₄S (SEQ ID NO:239) or (G₄S)₂ (SEQ ID NO:240). In some embodiments, the first Fc polypeptide is not linked to the ERT enzyme, the ERT enzyme variant, or the catalytically active fragment thereof by a chemical cross-linking agent, e.g., the fusion polypeptide does not include a non-peptide bond or a non-polypeptide linker.

In certain embodiments, the fusion polypeptide comprises from N- to C-terminus: the ERT enzyme, the ERT enzyme variant, or the catalytically active fragment thereof; the polypeptide linker; and the first Fc polypeptide.

In some embodiments, the second Fc polypeptide is linked to an ERT enzyme, an ERT enzyme variant, or a catalytically active fragment thereof. In some embodiments, the second Fc polypeptide is a fusion polypeptide that is linked to the ERT enzyme, the ERT enzyme variant, or the catalytically active fragment thereof by a peptide bond or by a polypeptide linker. In some embodiments, the polypeptide linker is a flexible polypeptide linker. In some embodiments, the flexible polypeptide linker is a glycine-rich linker. In some embodiments, the glycine-rich linker is G₄S (SEQ ID NO:239) or (G₄S)₂ (SEQ ID NO:240). In some embodiments, the second Fc polypeptide is not linked to the ERT enzyme, the ERT enzyme variant, or the catalytically active fragment thereof by a chemical cross-linking agent, e.g., the fusion polypeptide does not include a non-peptide bond or a non-polypeptide linker.

In some embodiments, the N-terminus of the first Fc polypeptide and/or the N-terminus of the second Fc polypeptide is linked to the ERT enzyme. In some embodiments, the N-terminus of the first Fc polypeptide is linked to one ERT enzyme and the N-terminus of the second Fc polypeptide is linked to the other ERT enzyme.

In some embodiments, the C-terminus of the first Fc polypeptide and/or the C-terminus of the second Fc polypeptide is linked to the ERT enzyme. In some embodiments, the C-terminus of the first Fc polypeptide is linked to one ERT enzyme and the C-terminus of the second Fc polypeptide is linked to the other ERT enzyme.

In some embodiments, the N-terminus of the first Fc polypeptide is linked to one ERT enzyme and the C-terminus of the second Fc polypeptide is linked to the other ERT enzyme. In some embodiments, the C-terminus of the first Fc polypeptide is linked to one ERT enzyme and the N-terminus of the second Fc polypeptide is linked to the other ERT enzyme.

In some embodiments, the protein comprises a single ERT enzyme, and the N-terminus or the C-terminus of the first Fc polypeptide is linked to the ERT enzyme. In some embodiments, the protein comprises two ERT enzymes (e.g., exactly two ERT enzymes). In some embodiments, the protein comprises exactly one or exactly two ERT enzymes, enzyme variants, or catalytically active fragments thereof.

In some embodiments, the first Fc polypeptide is a modified Fc polypeptide and/or the second Fc polypeptide is a modified Fc polypeptide.

In some embodiments, the first Fc polypeptide and the second Fc polypeptide each contain modifications that promote heterodimerization. In some embodiments, the Fc dimer is an Fc heterodimer. In some embodiments, one of the Fc polypeptides has a T366W substitution and the other Fc polypeptide has T366S, L368A, and Y407V substitutions, according to EU numbering. In some embodiments, the first Fc polypeptide contains the T366S, L368A, and Y407V substitutions and the second Fc polypeptide contains the T366W substitution. In some embodiments, the first Fc polypeptide is linked to the ERT enzyme IDS and comprises the amino acid sequence of any one of SEQ ID NOS:117, 232, and 236. In some embodiments, the first Fc polypeptide contains the T366W substitution and the second Fc polypeptide contains the T366S, L368A, and Y407V substitutions. In some embodiments, the first Fc polypeptide is linked to the ERT enzyme IDS and comprises the amino acid sequence of any one of SEQ ID NOS:118, 233, and 237.

In some embodiments, the first Fc polypeptide and/or the second Fc polypeptide comprises a native FcRn binding site. In some embodiments, the first Fc polypeptide and the second Fc polypeptide do not have effector function. In some embodiments, the first Fc polypeptide and/or the second Fc polypeptide includes a modification that reduces effector function. In some embodiments, the modification that reduces effector function is the substitutions of Ala at position 234 and Ala at position 235, according to EU numbering. In some embodiments, the modification that reduces effector function further comprises a substitution of Gly at position 329, according to EU numbering. In some embodiments, the first Fc polypeptide is linked to the ERT enzyme IDS and comprises the amino acid sequence of any one of SEQ ID NOS:115, 231, and 235. In some embodiments, the first Fc polypeptide is linked to the ERT enzyme SGSH and comprises the amino acid sequence of any one of SEQ ID NOS:149, 150, 152, and 153.

In some embodiments, the first Fc polypeptide and/or the second Fc polypeptide comprises amino acid changes relative to the native Fc sequence that extend serum half-life. In some embodiments, the amino acid changes comprise substitutions of Tyr at position 252, Thr at position 254, and Glu at position 256, according to EU numbering. Alternatively, in other embodiments, the amino acid changes comprise substitutions of Leu at position 428 and Ser at position 434, according to EU numbering. Alternatively, in further embodiments, the amino acid changes comprise a substitution of Ser or Ala at position 434, according to EU numbering.

In some embodiments, the first Fc polypeptide and/or the second Fc polypeptide specifically binds to a transferrin receptor (TfR).

In some embodiments, the first Fc polypeptide and/or the second Fc polypeptide comprises at least two substitutions at positions selected from the group consisting of 384, 386, 387, 388, 389, 390, 413, 416, and 421, according to EU numbering. In some embodiments, the first Fc polypeptide and/or the second Fc polypeptide comprises at least three, four, five, six, seven, eight, or nine substitutions at the positions.

In some embodiments, the first Fc polypeptide and/or the second Fc polypeptide further comprises one, two, three, or four substitutions at positions comprising 380, 391, 392, and 415, according to EU numbering. In some embodiments, the first Fc polypeptide and/or the second Fc polypeptide further comprises one, two, or three substitutions at positions comprising 414, 424, and 426, according to EU numbering.

In some embodiments, the first Fc polypeptide and/or the second Fc polypeptide comprises Trp at position 388. In some embodiments, the first Fc polypeptide and/or the second Fc polypeptide comprises an aromatic amino acid at position 421. In some embodiments, the aromatic amino acid at position 421 is Trp or Phe.

In some embodiments, the first Fc polypeptide and/or the second Fc polypeptide comprises at least one position selected from the following: position 380 is Trp, Leu, or Glu; position 384 is Tyr or Phe; position 386 is Thr; position 387 is Glu; position 388 is Trp; position 389 is Ser, Ala, Val, or Asn; position 390 is Ser or Asn; position 413 is Thr or Ser; position 415 is Glu or Ser; position 416 is Glu; and position 421 is Phe.

In some embodiments, the first Fc polypeptide and/or the second Fc polypeptide comprises 2, 3, 4, 5, 6, 7, 8, 9, 10, or 11 positions selected from the following: position 380 is Trp, Leu, or Glu; position 384 is Tyr or Phe; position 386 is Thr; position 387 is Glu; position 388 is Trp; position 389 is Ser, Ala, Val, or Asn; position 390 is Ser or Asn; position 413 is Thr or Ser; position 415 is Glu or Ser; position 416 is Glu; and position 421 is Phe.

In some embodiments, the first Fc polypeptide and/or the second Fc polypeptide comprises 11 positions as follows: position 380 is Trp, Leu, or Glu; position 384 is Tyr or Phe; position 386 is Thr; position 387 is Glu; position 388 is Trp; position 389 is Ser, Ala, Val, or Asn; position 390 is Ser or Asn; position 413 is Thr or Ser; position 415 is Glu or Ser; position 416 is Glu; and position 421 is Phe.

In some embodiments, the first Fc polypeptide and/or the second Fc polypeptide has a CH3 domain with at least 85% identity, at least 90% identity, or at least 95% identity to amino acids 111-217 of any one of SEQ ID NOS:34-38, 58, and 60-90, 151, and 156-229. In some embodiments, the first Fc polypeptide and/or the second Fc polypeptide comprises the amino acid sequence of any one of SEQ ID NOS:156-229. In some embodiments, the residues at at least 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, or 16 of the positions corresponding to EU index positions 380, 384, 386, 387, 388, 389, 390, 391, 392, 413, 414, 415, 416, 421, 424 and 426 of any one of SEQ ID NOS:34-38, 58, and 60-90, 151, and 156-229 are not deleted or substituted.

In some embodiments, the first Fc polypeptide and/or the second Fc polypeptide comprises the amino acid sequence of SEQ ID NO:157. In some embodiments, the first Fc polypeptide and/or the second Fc polypeptide comprises the amino acid sequence of SEQ ID NO:169. In some embodiments, the first Fc polypeptide and/or the second Fc polypeptide comprises the amino acid sequence of SEQ ID NO:181. In some embodiments, the first Fc polypeptide and/or the second Fc polypeptide comprises the amino acid sequence of SEQ ID NO:193. In some embodiments, the first Fc polypeptide and/or the second Fc polypeptide comprises the amino acid sequence of SEQ ID NO:205. In some embodiments, the first Fc polypeptide and/or the second Fc polypeptide comprises the amino acid sequence of SEQ ID NO:217.

In some embodiments, the first Fc polypeptide comprises the amino acid sequence of SEQ ID NO:115, and the second Fc polypeptide comprises the amino acid sequence of any one of SEQ ID NOS:205 and 228 (e.g., SEQ ID NO:228). In other embodiments, the first Fc polypeptide comprises the amino acid sequence of SEQ ID NO:115, and the second Fc polypeptide comprises the amino acid sequence of any one of SEQ ID NOS:169 and 229 (e.g., SEQ ID NO:229).

In some embodiments, the first Fc polypeptide comprises the amino acid sequence of SEQ ID NO:231, and the second Fc polypeptide comprises the amino acid sequence of any one of SEQ ID NOS:205 and 228 (e.g., SEQ ID NO:228). In other embodiments, the first Fc polypeptide comprises the amino acid sequence of SEQ ID NO:231, and the second Fc polypeptide comprises the amino acid sequence of any one of SEQ ID NOS:169 and 229 (e.g., SEQ ID NO:229).

In some embodiments, the first Fc polypeptide comprises the amino acid sequence of SEQ ID NO:235, and the second Fc polypeptide comprises the amino acid sequence of any one of SEQ ID NOS:205 and 228 (e.g., SEQ ID NO:228). In other embodiments, the first Fc polypeptide comprises the amino acid sequence of SEQ ID NO:235, and the second Fc polypeptide comprises the amino acid sequence of any one of SEQ ID NOS:169 and 229 (e.g., SEQ ID NO:229).

In some embodiments, the first Fc polypeptide and/or the second Fc polypeptide binds to the apical domain of TfR. In some embodiments, the binding of the protein to TfR does not substantially inhibit binding of transferrin to TfR.

In some embodiments, the first Fc polypeptide and/or the second Fc polypeptide has an amino acid sequence identity of at least 75%, or at least 80%, 85%, 90%, 92%, or 95%, as compared to the corresponding wild-type Fc polypeptide. In some embodiments, the corresponding wild-type Fc polypeptide is a human IgG1, IgG2, IgG3, or IgG4 Fc polypeptide.

In some embodiments, uptake of the ERT enzyme into the brain (e.g., using an appropriate animal model such as those described herein) is greater than the uptake of the ERT enzyme in the absence of the first Fc polypeptide and/or the second Fc polypeptide or the uptake of the ERT enzyme without the modifications to the first Fc polypeptide and/or the second Fc polypeptide that result in TfR binding. In some embodiments, uptake of the ERT enzyme into the brain is at least 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, or 100-fold greater as compared to the uptake of the ERT enzyme in the absence of the first Fc polypeptide and/or the second Fc polypeptide or as compared to the uptake of the ERT enzyme without the modifications to the first Fc polypeptide and/or the second Fc polypeptide that result in TfR binding.

In some embodiments, the first Fc polypeptide is not modified to bind to a blood-brain barrier (BBB) receptor and the second Fc polypeptide is modified to specifically bind to TfR. In some embodiments, the first Fc polypeptide is modified to specifically bind to TfR and the second Fc polypeptide is not modified to bind to a BBB receptor.

In some embodiments, the protein does not include an immunoglobulin heavy and/or light chain variable region sequence or an antigen-binding portion thereof.

In some aspects, provided herein is a polypeptide comprising an Fc polypeptide that is linked to an ERT enzyme, an ERT enzyme variant, or a catalytically active fragment thereof, wherein the Fc polypeptide contains one or more modifications that promote its heterodimerization to another Fc polypeptide.

In some embodiments, the ERT enzyme is IDS, an IDS variant, or a catalytically active fragment thereof. In some embodiments, the ERT enzyme comprises an amino acid sequence having at least 80%, 85%, 90%, or 95% identity to the amino acid sequence of any one of SEQ ID NOS:91, 92, 114, 230, and 234. In some embodiments, the ERT enzyme comprises the amino acid sequence of any one of SEQ ID NOS:91, 92, 114, 230, and 234.

In some embodiments, the ERT enzyme is SGSH, an SGSH variant, or a catalytically active fragment thereof. In some embodiments, the ERT enzyme comprises an amino acid sequence having at least 80%, 85%, 90%, or 95% identity to the amino acid sequence of any one of SEQ ID NOS:119 and 120. In some embodiments, the ERT enzyme comprises the amino acid sequence of any one of SEQ ID NOS:119 and 120.

In some embodiments, the ERT enzyme is ASM, an ASM variant, or a catalytically active fragment thereof. In some embodiments, the ERT enzyme comprises an amino acid sequence having at least 80%, 85%, 90%, or 95% identity to the amino acid sequence of any one of SEQ ID NOS:121, 122, and 123. In some embodiments, the ERT enzyme comprises the amino acid sequence of any one of SEQ ID NOS:121, 122, and 123.

In some embodiments, the ERT enzyme is GBA, a GBA variant, or a catalytically active fragment thereof. In some embodiments, the ERT enzyme comprises an amino acid sequence having at least 80%, 85%, 90%, or 95% identity to the amino acid sequence of any one of SEQ ID NOS:93 and 94. In some embodiments, the ERT enzyme comprises the amino acid sequence of any one of SEQ ID NOS:93 and 94.

In some embodiments, the Fc polypeptide is a fusion polypeptide that is linked to the ERT enzyme, the ERT enzyme variant, or the catalytically active fragment thereof by a peptide bond or by a polypeptide linker. In some embodiments, the polypeptide linker is a flexible polypeptide linker. In some embodiments, the flexible polypeptide linker is a glycine-rich linker. In some embodiments, the glycine-rich linker is G₄S (SEQ ID NO:239) or (G₄S)₂ (SEQ ID NO:240). In some embodiments, the Fc polypeptide is not linked to the ERT enzyme, the ERT enzyme variant, or the catalytically active fragment thereof by a chemical cross-linking agent, e.g., the fusion polypeptide does not include a non-peptide bond or a non-polypeptide linker.

In certain embodiments, the fusion polypeptide comprises from N- to C-terminus: the ERT enzyme, the ERT enzyme variant, or the catalytically active fragment thereof; the polypeptide linker; and the first Fc polypeptide.

In some embodiments, the Fc polypeptide contains T366S, L368A, and Y407V substitutions, according to EU numbering. In some embodiments, the polypeptide comprises the amino acid sequence of any one of SEQ ID NOS:115, 117, 231, 232, 235, and 236. In some embodiments, the polypeptide comprises the amino acid sequence of any one of SEQ ID NOS:149 and 150. In some embodiments, the Fc polypeptide contains a T366W substitution. In some embodiments, the polypeptide comprises the amino acid sequence of any one of SEQ ID NOS:118, 233, and 237. In some embodiments, the polypeptide comprises the amino acid sequence of any one of SEQ ID NOS:152-155. In some embodiments, the polypeptide further comprises the other Fc polypeptide. In some embodiments, the other Fc polypeptide contains a T366W substitution or contains T366S, L368A, and Y407V substitutions and forms an Fc dimer with the ERT enzyme-Fc fusion polypeptide.

In some embodiments, the Fc polypeptide comprises a native FcRn binding site. In some embodiments, the Fc polypeptide does not have effector function. In some embodiments, the Fc polypeptide includes a modification that reduces effector function. In some embodiments, the modification that reduces effector function is the substitutions of Ala at position 234 and Ala at position 235, according to EU numbering. In some embodiments, the modification that reduces effector function further comprises a substitution of Gly at position 329, according to EU numbering.

In some embodiments, the Fc polypeptide comprises amino acid changes relative to the native Fc sequence that extend serum half-life. In some embodiments, the amino acid changes comprise substitutions of Tyr at position 252, Thr at position 254, and Glu at position 256, according to EU numbering.

In some embodiments, the Fc polypeptide specifically binds to TfR.

In some embodiments, the Fc polypeptide comprises at least two substitutions at positions selected from the group consisting of 384, 386, 387, 388, 389, 390, 413, 416, and 421, according to EU numbering. In some embodiments, the Fc polypeptide comprises at least three, four, five, six, seven, eight, or nine substitutions at the positions.

In some embodiments, the Fc polypeptide further comprises one, two, three, or four substitutions at positions comprising 380, 391, 392, and 415, according to EU numbering. In some embodiments, the Fc polypeptide further comprises one, two, or three substitutions at positions comprising 414, 424, and 426, according to EU numbering.

In some embodiments, the Fc polypeptide comprises Trp at position 388. In some embodiments, the Fc polypeptide comprises an aromatic amino acid at position 421. In some embodiments, the aromatic amino acid at position 421 is Trp or Phe.

In some embodiments, the Fc polypeptide comprises at least one position selected from the following: position 380 is Trp, Leu, or Glu; position 384 is Tyr or Phe; position 386 is Thr; position 387 is Glu; position 388 is Trp; position 389 is Ser, Ala, Val, or Asn; position 390 is Ser or Asn; position 413 is Thr or Ser; position 415 is Glu or Ser; position 416 is Glu; and position 421 is Phe.

In some embodiments, the Fc polypeptide comprises 2, 3, 4, 5, 6, 7, 8, 9, 10, or 11 positions selected from the following: position 380 is Trp, Leu, or Glu; position 384 is Tyr or Phe; position 386 is Thr; position 387 is Glu; position 388 is Trp; position 389 is Ser, Ala, Val, or Asn; position 390 is Ser or Asn; position 413 is Thr or Ser; position 415 is Glu or Ser; position 416 is Glu; and position 421 is Phe.

In some embodiments, the Fc polypeptide comprises 11 positions as follows: position 380 is Trp, Leu, or Glu; position 384 is Tyr or Phe; position 386 is Thr; position 387 is Glu; position 388 is Trp; position 389 is Ser, Ala, Val, or Asn; position 390 is Ser or Asn; position 413 is Thr or Ser; position 415 is Glu or Ser; position 416 is Glu; and position 421 is Phe.

In some embodiments, the Fc polypeptide has a CH3 domain with at least 85% identity, at least 90% identity, or at least 95% identity to amino acids 111-217 of any one of SEQ ID NOS:34-38, 58, and 60-90. In some embodiments, the residues at least 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, or 16 of the positions corresponding to EU index positions 380, 384, 386, 387, 388, 389, 390, 391, 392, 413, 414, 415, 416, 421, 424 and 426 of any one of SEQ ID NOS:34-38, 58, and 60-90 are not deleted or substituted.

In some embodiments, the Fc polypeptide binds to the apical domain of TfR. In some embodiments, the binding of the protein to TfR does not substantially inhibit binding of transferrin to TfR.

In some embodiments, the Fc polypeptide has an amino acid sequence identity of at least 75%, or at least 80%, 85%, 90%, 92%, or 95%, as compared to the corresponding wild-type Fc polypeptide. In some embodiments, the corresponding wild-type Fc polypeptide is a human IgG1, IgG2, IgG3, or IgG4 Fc polypeptide.

In some embodiments, the Fc polypeptide does not include an immunoglobulin heavy and/or light chain variable region sequence or an antigen-binding portion thereof.

In some embodiments, provided herein is a polynucleotide comprising a nucleic acid sequence encoding a polypeptide comprising an Fc polypeptide that is linked to an ERT enzyme, an ERT enzyme variant, or a catalytically active fragment thereof, wherein the Fc polypeptide contains one or more modifications that promote its heterodimerization to another Fc polypeptide. In some embodiments, provided herein is a vector comprising the polynucleotide. In some embodiments, provided herein is a host cell comprising the polynucleotide or the vector. In some embodiments, the host cell further comprises a polynucleotide comprising a nucleic acid sequence encoding the other Fc polypeptide. In some embodiments, provided herein is a method for producing the polypeptide described herein, comprising culturing a host cell under conditions in which the polypeptide encoded by the polynucleotide is expressed.

In some aspects, provided herein is a protein comprising:

-   -   (a) a first polypeptide chain that comprises a modified Fc         polypeptide that specifically binds to TfR;     -   (b) a second polypeptide chain that comprises an Fc polypeptide,         wherein the first and second polypeptide chains form an Fc         dimer; and     -   (c) an ERT enzyme, an ERT enzyme variant, or a catalytically         active fragment thereof, that is linked to the modified Fc         polypeptide of (a) or to the Fc polypeptide of (b).

In some embodiments, the ERT enzyme is IDS, an IDS variant, or a catalytically active fragment thereof. In some embodiments, the ERT enzyme comprises an amino acid sequence having at least 80%, 85%, 90%, or 95% identity to the amino acid sequence of any one of SEQ ID NOS:91, 92, 114, 230, and 234. In some embodiments, the ERT enzyme comprises the amino acid sequence of any one of SEQ ID NOS:91, 92, 114, 230, and 234.

In some embodiments, the ERT enzyme is SGSH, an SGSH variant, or a catalytically active fragment thereof. In some embodiments, the ERT enzyme comprises an amino acid sequence having at least 80%, 85%, 90%, or 95% identity to the amino acid sequence of any one of SEQ ID NOS:119 and 120. In some embodiments, the ERT enzyme comprises the amino acid sequence of any one of SEQ ID NOS:119 and 120.

In some embodiments, the ERT enzyme is ASM, an ASM variant, or a catalytically active fragment thereof. In some embodiments, the ERT enzyme comprises an amino acid sequence having at least 80%, 85%, 90%, or 95% identity to the amino acid sequence of any one of SEQ ID NOS:121, 122, and 123. In some embodiments, the ERT enzyme comprises the amino acid sequence of any one of SEQ ID NOS:121, 122, and 123.

In some embodiments, the ERT enzyme is GBA, a GBA variant, or a catalytically active fragment thereof. In some embodiments, the ERT enzyme comprises an amino acid sequence having at least 80%, 85%, 90%, or 95% identity to the amino acid sequence of any one of SEQ ID NOS:93 and 94. In some embodiments, the ERT enzyme comprises the amino acid sequence of any one of SEQ ID NOS:93 and 94.

In some embodiments, the ERT enzyme is linked to the modified Fc polypeptide of (a). In some embodiments, the ERT enzyme is linked to the Fc polypeptide of (b). In some embodiments, the Fc polypeptide of (b) is not modified to bind to a BBB receptor. In some embodiments, the Fc polypeptide of (b) is a modified Fc polypeptide that specifically binds to TfR.

In some embodiments, the ERT enzyme is linked (e.g., fused) to the modified Fc polypeptide of (a) or to the Fc polypeptide of (b) by a peptide bond or by a polypeptide linker to form a fusion polypeptide. In some embodiments, the polypeptide linker is a flexible polypeptide linker. In some embodiments, the flexible polypeptide linker is a glycine-rich linker. In some embodiments, the glycine-rich linker is G₄S (SEQ ID NO:239) or (G₄S)₂ (SEQ ID NO:240). In some embodiments, the ERT enzyme is not linked to the modified Fc polypeptide of (a) or to the Fc polypeptide of (b) by a chemical cross-linking agent, e.g., the fusion polypeptide does not include a non-peptide bond or a non-polypeptide linker.

In some embodiments, the ERT enzyme is linked to the N-terminus of the modified Fc polypeptide of (a) or to the N-terminus of the Fc polypeptide of (b). In some embodiments, the ERT enzyme is linked to the C-terminus of the modified Fc polypeptide of (a) or to the C-terminus of the Fc polypeptide of (b).

In some embodiments, the protein comprises two ERT enzymes. In some embodiments, one ERT enzyme is linked to the modified Fc polypeptide of (a) and the other ERT enzyme is linked to the Fc polypeptide of (b). In some embodiments, the ERT enzymes are both linked to the N-terminus or are both linked to the C-terminus of the respective Fc polypeptides. In some embodiments, one ERT enzyme is linked to the N-terminus of the modified Fc polypeptide of (a) and the other ERT enzyme is linked to the C-terminus of the Fc polypeptide of (b). In some embodiments, one ERT enzyme is linked to the C-terminus of the modified Fc polypeptide of (a) and the other ERT enzyme is linked to the N-terminus of the Fc polypeptide of (b).

In some embodiments, the Fc polypeptides of (a) and (b) each contain modifications that promote heterodimerization. In some embodiments, one of the Fc polypeptides has a T366W substitution and the other Fc polypeptide has T366S, L368A, and Y407V substitutions, according to EU numbering. In some embodiments, the modified Fc polypeptide of (a) contains the T366W substitution and the Fc polypeptide of (b) contains the T366S, L368A, and Y407V substitutions. In some embodiments, the Fc polypeptide of (b) is linked to the ERT enzyme IDS and comprises the amino acid sequence of any one of SEQ ID NOS:117, 232, and 236. In some embodiments, the modified Fc polypeptide of (a) contains the T366S, L368A, and Y407V substitutions and the Fc polypeptide of (b) contains the T366W substitution. In some embodiments, the Fc polypeptide of (b) is linked to the ERT enzyme IDS and comprises the amino acid sequence of any one of SEQ ID NOS:118, 233, and 237.

In some embodiments, the modified Fc polypeptide of (a) and/or the Fc polypeptide of (b) comprises a native FcRn binding site. In some embodiments, the modified Fc polypeptide of (a) and the Fc polypeptide of (b) do not have effector function. In some embodiments, the modified Fc polypeptide of (a) and/or the Fc polypeptide of (b) includes a modification that reduces effector function. In some embodiments, the modification that reduces effector function is the substitutions of Ala at position 234 and Ala at position 235, according to EU numbering. In some embodiments, the modification that reduces effector function further comprises a substitution of Gly at position 329, according to EU numbering. In some embodiments, the Fc polypeptide of (b) is linked to the ERT enzyme IDS and comprises the amino acid sequence of any one of SEQ ID NOS:115, 231, and 235. In some embodiments, the Fc polypeptide of (b) is linked to the ERT enzyme SGSH and comprises the amino acid sequence of any one of SEQ ID NOS:149, 150, 152, and 153.

In some embodiments, the modified Fc polypeptide of (a) and/or the Fc polypeptide of (b) comprises amino acid changes relative to the native Fc sequence that extend serum half-life. In some embodiments, the amino acid changes comprise substitutions of Tyr at position 252, Thr at position 254, and Glu at position 256, according to EU numbering.

In some embodiments, the modified Fc polypeptide comprises at least two substitutions at positions selected from the group consisting of 384, 386, 387, 388, 389, 390, 413, 416, and 421, according to EU numbering. In some embodiments, the modified Fc polypeptide comprises at least three, four, five, six, seven, eight, or nine substitutions at the positions.

In some embodiments, the modified Fc polypeptide further comprises one, two, three, or four substitutions at positions comprising 380, 391, 392, and 415, according to EU numbering. In some embodiments, the modified Fc polypeptide further comprises one, two, or three substitutions at positions comprising 414, 424, and 426, according to EU numbering.

In some embodiments, the modified Fc polypeptide comprises Trp at position 388. In some embodiments, the modified Fc polypeptide comprises an aromatic amino acid at position 421. In some embodiments, the aromatic amino acid at position 421 is Trp or Phe.

In some embodiments, the modified Fc polypeptide comprises at least one position selected from the following: position 380 is Trp, Leu, or Glu; position 384 is Tyr or Phe; position 386 is Thr; position 387 is Glu; position 388 is Trp; position 389 is Ser, Ala, Val, or Asn; position 390 is Ser or Asn; position 413 is Thr or Ser; position 415 is Glu or Ser; position 416 is Glu; and position 421 is Phe.

In some embodiments, the modified Fc polypeptide comprises 2, 3, 4, 5, 6, 7, 8, 9, 10, or 11 positions selected from the following: position 380 is Trp, Leu, or Glu; position 384 is Tyr or Phe; position 386 is Thr; position 387 is Glu; position 388 is Trp; position 389 is Ser, Ala, Val, or Asn; position 390 is Ser or Asn; position 413 is Thr or Ser; position 415 is Glu or Ser; position 416 is Glu; and position 421 is Phe.

In some embodiments, the modified Fc polypeptide comprises 11 positions as follows: position 380 is Trp, Leu, or Glu; position 384 is Tyr or Phe; position 386 is Thr; position 387 is Glu; position 388 is Trp; position 389 is Ser, Ala, Val, or Asn; position 390 is Ser or Asn; position 413 is Thr or Ser; position 415 is Glu or Ser; position 416 is Glu; and position 421 is Phe.

In some embodiments, the modified Fc polypeptide has a CH3 domain with at least 85% identity, at least 90% identity, or at least 95% identity to amino acids 111-217 of any one of SEQ ID NOS:34-38, 58, and 60-90, 151, and 156-229. In some embodiments, the modified Fc polypeptide comprises the amino acid sequence of any one of SEQ ID NOS:156-229. In some embodiments, the residues at least 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, or 16 of the positions corresponding to EU index positions 380, 384, 386, 387, 388, 389, 390, 391, 392, 413, 414, 415, 416, 421, 424 and 426 of any one of SEQ ID NOS:34-38, 58, and 60-90, 151, and 156-229 are not deleted or substituted.

In some embodiments, the modified Fc polypeptide comprises the amino acid sequence of SEQ ID NO:157. In some embodiments, the modified Fc polypeptide comprises the amino acid sequence of SEQ ID NO:169. In some embodiments, the modified Fc polypeptide comprises the amino acid sequence of SEQ ID NO:181. In some embodiments, the modified Fc polypeptide comprises the amino acid sequence of SEQ ID NO:193. In some embodiments, the modified Fc polypeptide comprises the amino acid sequence of SEQ ID NO:205. In some embodiments, the modified Fc polypeptide comprises the amino acid sequence of SEQ ID NO:217.

In some embodiments, the first polypeptide chain comprises the amino acid sequence of any one of SEQ ID NOS:205 and 228 (e.g., SEQ ID NO:228) and the second polypeptide chain comprises the amino acid sequence of SEQ ID NO:115. In other embodiments, the first polypeptide chain comprises the amino acid sequence of any one of SEQ ID NOS:169 and 229 (e.g., SEQ ID NO:229) and the second polypeptide chain comprises the amino acid sequence of SEQ ID NO:115.

In some embodiments, the first polypeptide chain comprises the amino acid sequence of any one of SEQ ID NOS:205 and 228 (e.g., SEQ ID NO:228) and the second polypeptide chain comprises the amino acid sequence of SEQ ID NO:231. In other embodiments, the first polypeptide chain comprises the amino acid sequence of any one of SEQ ID NOS:169 and 229 (e.g., SEQ ID NO:229) and the second polypeptide chain comprises the amino acid sequence of SEQ ID NO:231.

In some embodiments, the first polypeptide chain comprises the amino acid sequence of any one of SEQ ID NOS:205 and 228 (e.g., SEQ ID NO:228) and the second polypeptide chain comprises the amino acid sequence of SEQ ID NO:235. In other embodiments, the first polypeptide chain comprises the amino acid sequence of any one of SEQ ID NOS:169 and 229 (e.g., SEQ ID NO:229) and the second polypeptide chain comprises the amino acid sequence of SEQ ID NO:235.

In some embodiments, the modified Fc polypeptide binds to the apical domain of TfR. In some embodiments, the binding of the protein to TfR does not substantially inhibit binding of transferrin to TfR.

In some embodiments, the modified Fc polypeptide has an amino acid sequence identity of at least 75%, or at least 80%, 85%, 90%, 92%, or 95%, as compared to the corresponding wild-type Fc polypeptide. In some embodiments, the corresponding wild-type Fc polypeptide is a human IgG1, IgG2, IgG3, or IgG4 Fc polypeptide.

In some embodiments, uptake of the ERT enzyme into the brain (e.g., using an appropriate animal model such as those described herein) is greater than the uptake of the ERT enzyme in the absence of the Fc polypeptide or the uptake of the ERT enzyme without the modifications to the Fc polypeptide that result in TfR binding. In some embodiments, uptake of the ERT enzyme into the brain is at least 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, or 100-fold greater as compared to the uptake of the ERT enzyme in the absence of the Fc polypeptide or as compared to the uptake of the ERT enzyme without the modifications to the Fc polypeptide that result in TfR binding.

In some aspects, provided herein is a method of treating an LSD, the method comprising administering a protein or polypeptide as described above to a patient in need thereof. In some embodiments, the method decreases the accumulation of a toxic metabolic product in the patient, e.g., a toxic metabolic product in the patient's brain and/or cerebrospinal fluid (CSF) is decreased.

In related aspects, provided herein is a method of decreasing the accumulation of a toxic metabolic product in a patient having an LSD, the method comprising administering a protein or polypeptide as described above to the patient. In some embodiments, the method decreases the accumulation of a toxic metabolic product in the patient's brain and/or CSF.

In some embodiments, the LSD is Hunter syndrome, and the ERT enzyme is IDS. In some embodiments, the toxic metabolic product comprises heparan sulfate-derived disaccharides and/or dermatan sulfate-derived disaccharides.

In some embodiments, the LSD is Sanfilippo syndrome A, and the ERT enzyme is SGSH. In some embodiments, the toxic metabolic product comprises heparan sulfate-derived oligosaccharides (e.g., hexasaccharides).

In some embodiments, the LSD is Niemann-Pick disease, and the ERT enzyme is ASM. In some embodiments, the toxic metabolic product comprises sphingomyelin.

In some embodiments, the LSD is Gaucher's disease or Parkinson's disease, and the ERT enzyme is GBA. In some embodiments, the toxic metabolic product comprises glucosylceramide.

In some embodiments, the total amount of a toxic metabolic product is reduced by at least about 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, or 95% as compared to the total amount of the toxic metabolic product in the absence of the protein or polypeptide. Illustrative assays for measuring ERT enzyme activity and substrate accumulation are described herein.

In some aspects, provided herein is a pharmaceutical composition comprising a protein or polypeptide as described above and a pharmaceutically acceptable carrier.

In some aspects, provided herein is a method of monitoring substrate accumulation to assess IDS activity, the method comprising:

-   -   (a) disrupting cells in a cell or tissue sample or microvesicles         in a fluid sample from a subject administered a protein or         polypeptide as described above to break open microvesicles to         obtain a glycosaminoglycan (GAG) solution to be analyzed;     -   (b) digesting the GAG solution with at least one heparinase         (e.g., heparinase I, heparinase II, and heparinase III) and         chrondroitinase B to obtain GAG-derived disaccharides;     -   (c) analyzing the GAG-derived disaccharides by mass spectrometry         (e.g., LC-MS/MS); and     -   (d) determining the levels of heparan sulfate- and/or dermatan         sulfate-derived disaccharides, wherein decreased levels of         heparan sulfate- and/or dermatan sulfate-derived disaccharides         compared to a control that lacks IDS activity is indicative of         increased IDS activity in the sample compared to the control.

In some embodiments, the step of disrupting cells or microvesicles comprises at least one freeze-thaw cycle and/or at least one sonication step. In some embodiments, the cells are from a tissue sample and the method comprises at least three, four, or five freeze-thaw cycles. In some embodiments, the subject is a mouse deficient in IDS activity. In some embodiments, the subject is a non-human primate. In some embodiments, the subject is a human patient having Hunter syndrome.

In some embodiments, the levels of heparan sulfate- and/or dermatan sulfate-derived disaccharides are reduced by at least about 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, or 95% as compared to the levels of heparan sulfate- and/or dermatan sulfate-derived disaccharides in a control that lacks IDS activity. In some embodiments, the control is a cell or tissue sample of the same cell of tissue type obtained from the subject prior to administration of the protein or polypeptide. In some embodiments, the control is a cell or tissue sample of the same cell of tissue type known to be deficient in IDS activity. In some embodiments, the protein or polypeptide increases IDS activity in the sample by at least 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, or 100-fold as compared to the IDS activity in the control.

In other aspects, provided herein is a method of monitoring substrate accumulation to assess SGSH activity, the method comprising:

-   -   (a) disrupting cells in a cell or tissue sample or microvesicles         in a fluid sample from a subject administered a protein or         polypeptide as described above to break open microvesicles to         obtain a glycosaminoglycan (GAG) solution to be analyzed;     -   (b) digesting the GAG solution with at least one heparinase to         obtain GAG-derived disaccharides;     -   (c) analyzing the GAG-derived disaccharides by mass spectrometry         (e.g., LC-MS/MS); and     -   (d) determining the levels of heparan sulfate-derived         disaccharides, wherein decreased levels of heparan         sulfate-derived disaccharides compared to a control that lacks         SGSH activity is indicative of increased SGSH activity in the         sample compared to the control.

In some embodiments, the step of disrupting cells or microvesicles comprises at least one freeze-thaw cycle and/or at least one sonication step. In some embodiments, the cells are from a tissue sample and the method comprises at least three, four, or five freeze-thaw cycles. In some embodiments, the subject is a mouse deficient in SGSH activity. In some embodiments, the subject is a non-human primate. In some embodiments, the subject is a human patient having Sanfilippo syndrome A.

In some embodiments, the levels of heparan sulfate-derived disaccharides are reduced by at least about 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, or 95% as compared to the levels of heparan sulfate-derived disaccharides in a control that lacks SGSH activity. In some embodiments, the control is a cell or tissue sample of the same cell of tissue type obtained from the subject prior to administration of the protein or polypeptide. In some embodiments, the control is a cell or tissue sample of the same cell of tissue type known to be deficient in SGSH activity. In some embodiments, the protein or polypeptide increases SGSH activity in the sample by at least 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, or 100-fold as compared to the SGSH activity in the control.

In other aspects, provided herein is a method for transporting an agent across the BBB of a mammal, the method comprising exposing the BBB to a protein that binds to a TfR with an affinity of from about 50 nM to about 250 nM, wherein the protein is linked to the agent and transports the linked agent across the BBB. In some embodiments, the maximum concentration (C_(max)) of the agent in the brain of the mammal is improved. In some embodiments, the agent is useful for treating an LSD.

In still other aspects, provided herein is a method for treating an LSD, the method comprising administering to a mammal a protein that binds to a TfR with an affinity of from about 50 nM to about 250 nM, wherein the protein is linked to an agent for treating the LSD, thereby exposing the brain of the mammal to the agent. In some embodiments, the protein improves the C_(max) of the agent in the brain as compared to the agent linked to a reference protein that binds to the TfR with a weaker affinity. In some embodiments, the reference protein binds to the TfR with an affinity of about 600 nM, or weaker.

In some embodiments, the TfR is a primate TfR. In some embodiments, the primate TfR is a human TfR. In some embodiments, the protein binds to the TfR apical domain.

In some embodiments, the protein binds to the TfR with an affinity of from about 100 nM to about 200 nM. In some embodiments, the protein binds to the TfR with an affinity of from about 110 nM to about 150 nM.

In some embodiments, the therapeutically effective concentration of the agent is a concentration that treats one or more symptoms of an LSD in the mammal. In some embodiments, the agent is a protein replacement therapeutic. In some embodiments, the agent or protein replacement therapeutic is an enzyme.

In some embodiments, the enzyme decreases the accumulation of a toxic metabolic product in the brain of the mammal having the LSD to a greater extent when linked to the protein as compared to when the enzyme is linked to the reference protein. In some embodiments, the enzyme is IDS and the LSD is Hunter syndrome. In some embodiments, the toxic metabolic product comprises heparin sulfate-derived disaccharides and/or dermatan sulfate-derived disaccharides. In some embodiments, the enzyme is SGSH and the LSD is Sanfilippo syndrome A. In some embodiments, the enzyme is ASM and the LSD is Niemann-Pick disease. In some embodiments, the enzyme is GBA and the LSD is Gaucher's disease.

In some embodiments, the agent comprises an antibody variable region. In some embodiments, the agent comprises an antibody fragment. In some embodiments, the agent comprises a Fab or scFv.

In some embodiments, the protein is a modified Fc polypeptide that contains a non-native binding site capable of binding TfR. In some embodiments, the protein comprises an antibody variable region that specifically binds TfR. In some embodiments, the protein comprises an antibody fragment. In some embodiments, the protein comprises a Fab or scFv.

In some embodiments, the protein linked to the agent is administered as part of a pharmaceutically acceptable carrier.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 shows the purification and analysis of an IDS-Fc fusion protein comprising an Fc polypeptide linked to an iduronate-2-sulfatase (IDS) enzyme and a modified Fc polypeptide that binds to transferrin receptor (TfR).

FIG. 2 shows the results of a binding affinity assay for IDS-Fc fusion proteins analyzed in FIG. 1 demonstrating that the fusion proteins bind to TfR.

FIG. 3 provides data demonstrating in vitro IDS activity of IDS-Fc fusion proteins analyzed in FIG. 1.

FIG. 4 provides data demonstrating that knockout cells that lack IDS have increased levels of heparan sulfate-derived disaccharides as assessed using an LC-MS/MS assay, and expression of IDS in the IDS knockout (KO) cells rescues the knockout phenotype.

FIG. 5A shows that IDS-Fc fusion proteins as either an N-terminal monozyme or a C-terminal monozyme comprising the same TfR-binding Fc polypeptide (i.e., CH3C.35.21.17) reverse heparan sulfate and dermatan sulfate accumulation in IDS KO cells. FIG. 5B shows that the N-terminal monozyme (“ETV:IDS 35.21.17”) has comparable cellular efficacy to IDS. FIG. 5C shows dose-dependent reduction of accumulated S³⁵-sulfate labeled proteins in MPS II patient fibroblasts treated with IDS-Fc fusion protein (“ETV:IDS”) or IDS; n=8. FIG. 5D shows assessment of M6PR-dependent clearance of S³⁵-labeled proteins in MPS II patient fibroblasts treated with increasing doses of IDS-Fc fusion protein (“ETV:IDS”) or IDS in the presence or absence of 5 mM M6P; n=3. FIGS. 5C-5D: “ETV:IDS”=ETV:IDS 35.23.2; Graphs display mean values across experimental replicates ±SEM.

FIG. 6 provides data illustrating heparan and dermatan sulfate levels in serum from wild-type (WT) mice dosed with vehicle or IDS KO mice dosed with IDS or an IDS-Fc fusion protein (“ETV:IDS”) in mice over time. “ETV:IDS”=ETV:IDS 35.21.

FIG. 7 provides data illustrating that the collective levels of disaccharides D0SO, DOA0, and D0a4 (referred to as “Total sGAG levels”) in peripheral tissues from IDS KO mice were assessed seven days post-administration of a single intravenous injection of 40 mg/kg IDS-Fc fusion protein (“ETV:IDS”) or 5.3 mg/kg IDS and compared to vehicle-treated IDS KO and wild-type mice; n=8 for IDS KO groups and n=3 for wild-type groups. Data shown as mean±SEM with p values: one-way ANOVA with Dunnett multiple comparison test; ** p<0.01 and **** p<0.0001. “ETV:IDS”=ETV:IDS 35.21.

FIG. 8 provides data illustrating the concentration of IDS-Fc fusion proteins in the brains of human TfR knock-in (TfR^(ms/hu) KI) mice after peripheral administration of the IDS-Fc fusion protein ETV:IDS 35.21 or a control IDS-Fc fusion protein lacking the mutations that confer TfR binding (“IDS:Fc”).

FIG. 9A provides data illustrating the concentration of IDS-Fc fusion proteins in the brains of TfR^(ms/hu) KI mice after peripheral administration of the IDS-Fc fusion protein ETV:IDS 35.21.17.2 or ETV:IDS 35.23.2, or a control IDS-Fc fusion protein lacking the mutations that confer TfR binding (“IDS:Fc”). FIG. 9B provides data illustrating the liver concentration of the IDS-Fc fusion protein ETV:IDS 35.21 or IDS:Fc in TfR^(ms/hu) KI mice following a single intravenous injection of 50 mg/kg dose; n=4-5. Graphs display mean±SEM.

FIGS. 10A-10C show that ETV:IDS reduces GAGs in the brain and peripheral tissues of IDS KO×TfR^(ms/hu) KI mice. IDS KO×TfR^(ms/hu) KI mice were administered a single intravenous injection of 40 mg/kg ETV:IDS or 14.2 mg/kg IDS or four weekly doses as described in Example 2. The concentration of IDS in serum (FIG. 10A) and tissues (FIG. 10B) was measured in IDS KO×TfR^(ms/hu) KI mice following a single dose. Tissue PK is shown 2 h post-dose; n=4. Graphs display mean±SEM and p values: unpaired t-test analysis. FIG. 10C shows levels of disaccharides D0SO, DOA0, and D0a4 (“Total sGAG levels”) in brain, CSF, and peripheral tissues of IDS KO×TfR^(ms/hu) KI mice were measured after a single dose or multiple doses of ETV:IDS or IDS and compared to vehicle treatment and wild-type mice; n=8 per IDS KO×TfR^(ms/hu) KI groups and n=5 for wild-type group. Graphs display mean±SEM and p values: one-way ANOVA with Dunnett multiple comparison test; ** p<0.01, *** p<0.001, and **** p<0.0001.

FIG. 11 shows the purification and analysis of an ASM-Fc fusion protein comprising an Fc region linked to two acid sphingomyelinase (ASM) enzymes.

FIG. 12 provides data demonstrating in vitro ASM activity of an ASM-Fc fusion protein analyzed in FIG. 11.

FIG. 13 provides data showing that an ASM-Fc fusion protein as analyzed in FIG. 11 reduces sphingomyelin accumulation in ASMKO cells using an imaging-based assay.

FIG. 14 provides data showing that an ASM-Fc fusion protein as analyzed in FIG. 11 reduces sphingomyelin accumulation in ASMKO cells using an LC-MS/MS-based assay.

FIG. 15 provides data demonstrating in vitro N-sulfoglucosamine sulfohydrolase (SGSH) activity of SGSH-Fc fusion proteins described in Example 6.

FIG. 16 provides data demonstrating that knockout (KO) cells that lack SGSH have increased levels of heparan sulfate-derived disaccharides as assessed using an LC-MS/MS assay. n=3-4 independent cell lines; data shown as mean±s.e.m.

FIG. 17 provides data illustrating that a SGSH-Fc fusion protein analyzed in FIG. 15 reverses heparan sulfate accumulation in SGSH KO cells.

FIG. 18 shows the relationship between engineered TfR-binding polypeptide hTfR affinity and brain exposure over time in TfR^(ms/hu) KI mice. Dots represent cumulative brain exposure over time (AUC) of different engineered TfR-binding polypeptide affinity variants following a single dose of 50 mg/kg in TfR^(ms/hu) KI mice. Brain concentrations of polypeptide (as measured by huIgG1) were calculated at various days post-dose (ranges from 1-10 days). Data represents summary of three independent studies, n=4-5 mice per group for each study.

FIG. 19 shows the relationship between engineered TfR-binding polypeptide hTfR affinity and maximum brain concentration in TfR^(ms/hu) KI mice. Dots represent maximum brain concentrations of different polypeptide affinity variants measured at 1 day post-dose after a single 50 mg/kg dose. Data represents summary of three independent studies, n=4-5 mice per group for each study.

FIG. 20 shows the relationship between engineered TfR-binding polypeptide hTfR affinity and ratio of brain versus plasma concentration of polypeptide in TfR^(ms/hu) KI mice. Dots represent ratio of maximum brain versus plasma concentration of different polypeptide affinity variants measured at 1 day post-dose after a single 50 mg/kg dose. Data represents summary of three independent studies, n=4-5 mice per group for each study.

FIGS. 21A and 21B depict huIgG1 concentrations in plasma (FIG. 21A) and brain lysates (FIG. 21B) of TfR^(ms/hu) knock-in (KI) mice after a single 50 mg/kg systemic injection of anti-BACE1_Ab153, CH3C35.21:Ab153, CH3C35.20:Ab153, or CH3C35:Ab153 polypeptide fusion (mean±SEM, n=5 per group).

FIG. 21C depicts endogenous mouse Aβ concentration in brain lysate of TfR^(ms/hu) KI mice after a single 50 mg/kg systemic injection of anti-BACE1 Ab153, CH3C35.21:Ab153, CH3C35.20:Ab153, or CH3C35:Ab153 polypeptide fusion (mean±SEM, n=5 per group).

FIG. 21D depicts Western blot quantification of brain TfR protein normalized to actin in brain lysate of TfR^(ms/hu) KI mice after a single 50 mg/kg systemic injection of anti-BACE1_Ab153, CH3C35.21:Ab153, CH3C35.20:Ab153, or CH3C35:Ab153 polypeptide fusion (mean±SEM, n=5 per group).

DETAILED DESCRIPTION I. Introduction

We have developed fusion proteins that include an enzyme replacement therapy (ERT) enzyme linked to an Fc polypeptide. These proteins can be used to treat lysosomal storage disorders (LSDs). In some cases, the protein includes a dimeric Fc polypeptide, where one of the Fc polypeptide monomers is linked to the ERT enzyme. The Fc polypeptides can increase enzyme half-life and, in some cases, can be modified to confer additional functional properties onto the protein. Also described herein are fusion proteins that facilitate delivery of an ERT enzyme across the blood-brain barrier (BBB). These proteins comprise an Fc polypeptide and a modified Fc polypeptide that form a dimer, and an ERT enzyme linked to the Fc region and/or the modified Fc region. The modified Fc region can specifically bind to a BBB receptor such as a transferrin receptor (TfR). In some embodiments, the ERT enzyme is iduronate 2-sulfatase (IDS), or a catalytically active variant or fragment of a wild-type IDS, e.g., a wild-type human IDS. In other embodiments, the ERT enzyme is N-sulfoglucosamine sulfohydrolase (SGSH), acid sphingomyelinase (ASM), β-glucocerbrosidase (GBA), or a catalytically active variant or fragment of a wild-type SGSH, ASM, or GBA, e.g., a wild-type human SGSH, ASM, or GBA.

We have also developed a method for transporting therapeutic agents that are linked to TfR-binding polypeptides and proteins across the BBB for the treatment of disease. We discovered that the desired TfR binding affinity for transporting a therapeutic agent across the BBB depends on the target of the therapeutic agent, as well as the mechanism of action that drives efficacy in treating the disease. In particular, we discovered that using polypeptides and proteins that have stronger TfR affinities results in greater C_(max) but faster clearance.

For some therapies, such as protein replacement therapies that can be used to treat, e.g., LSDs, which include the use of ERT enzymes such as IDS (e.g., for the treatment of Hunter syndrome), as well as others, achieving a high brain C_(max) of the therapeutic is desired over the dosing window, as higher extracellular concentrations will in turn drive increased intracellular protein concentrations. Once delivered into cells, the intracellular half-life of the delivered protein is sustained for a longer period of time compared to plasma residence time. In addition, having a high C_(max) may be beneficial for enzyme replacement, as the high enzyme concentration can drive an increased rate of substrate turnover by the enzyme. For improving brain C_(max), using polypeptides and proteins that have a TfR affinity range of 50-250 nM is particularly useful.

II. Definitions

As used herein, the singular forms “a,” “an,” and “the” include plural referents unless the content clearly dictates otherwise. Thus, for example, reference to “a polypeptide” may include two or more such molecules, and the like.

As used herein, the terms “about” and “approximately,” when used to modify an amount specified in a numeric value or range, indicate that the numeric value as well as reasonable deviations from the value known to the skilled person in the art, for example ±20%, ±10%, or ±5%, are within the intended meaning of the recited value.

An “enzyme replacement therapy enzyme” or “ERT enzyme” refers to an enzyme that is deficient in a lysosomal storage disorder. An “ERT enzyme variant” refers to a functional variant, including allelic and splice variants, of a wild-type ERT enzyme or a fragment thereof, where the ERT enzyme variant has at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, or at least 95% of the activity of the corresponding wild-type ERT enzyme or fragment thereof, e.g., when assayed under identical conditions. A “catalytically active fragment” of an ERT enzyme refers to a portion of a full-length ERT enzyme or a variant thereof, where the catalytically active fragment has at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, or at least 95% of the activity of the corresponding full-length ERT enzyme or variant thereof, e.g., when assayed under identical conditions.

An “iduronate sulfatase,” “iduronate-2-sulfatase,” or “IDS” as used herein refers to iduronate 2-sulfatase (EC 3.1.6.13), which is an enzyme involved in the lysosomal degradation of the glycosaminoglycans heparan sulfate and dermatan sulfate. Deficiency of IDS is associated with Mucopolysaccharidosis II, also known as Hunter syndrome. The term “IDS” as used herein as a component of a protein that comprises an Fc polypeptide is catalytically active and encompasses functional variants, including allelic and splice variants, of a wild-type IDS or a fragment thereof. The sequence of human IDS isoform I, which is the human sequence designated as the canonical sequence, is available under UniProt entry P22304 and is encoded by the human IDS gene at Xq28. The full-length sequence is provided as SEQ ID NO:91. A “mature” IDS sequence as used herein refers to a form of a polypeptide chain that lacks the signal and propeptide sequences of the naturally occurring full-length polypeptide chain. The amino acid sequence of a mature human IDS polypeptide is provided as SEQ ID NO:92, which corresponds to amino acids 34-550 of the full-length human sequence. A “truncated” IDS sequence as used herein refers to a catalytically active fragment of the naturally occurring full-length polypeptide chain. The amino acid sequence of an exemplary truncated human IDS polypeptide is provided as SEQ ID NO:114, which corresponds to amino acids 26-550 of the full-length human sequence. The structure of human IDS has been well-characterized. An illustrative structure is available under PDB accession code 5FQL. The structure is also described in Nat. Comm. 8:15786 doi: 10.1038/ncomms15786, 2017. Non-human primate IDS sequences have also been described, including chimpanzee (UniProt entry K7BKV4) and rhesus macaque (UniProt entry H9FTX2). A mouse IDS sequence is available under Uniprot entry Q08890. An IDS variant has at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, or at least 95% of the activity of the corresponding wild-type IDS or fragment thereof, e.g., when assayed under identical conditions. A catalytically active IDS fragment has at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, or at least 95% of the activity of the corresponding full-length IDS or variant thereof, e.g., when assayed under identical conditions.

A “sulfoglucosamine sulfohydrolase,” “N-sulfoglucosamine sulfohydrolase,” or “SGSH” as used herein refers to N-sulfoglucosamine sulfohydrolase (EC 3.10.1.1), which is an enzyme involved in the lysosomal degradation of heparan sulfate. Mutations in this gene are associated with Sanfilippo syndrome A, one type of the lysosomal storage disorder mucopolysaccaridosis III, which results from impaired degradation of heparan sulfate. The term “SGSH” as used herein as a component of a protein that comprises an Fc polypeptide is catalytically active and encompasses functional variants, including allelic and splice variants, of a wild-type SGSH or a fragment thereof. The sequence of human SGSH is available under UniProt entry P51688 and is encoded by the human SGSH gene at 17q25.3. The full-length sequence is provided as SEQ ID NO:119. A “mature” SGSH sequence as used herein refers to a form of a polypeptide chain that lacks the signal sequence of the naturally occurring full-length polypeptide chain. The amino acid sequence of a mature human SGSH polypeptide is provided as SEQ ID NO:120, which corresponds to amino acids 21-502 of the full-length human sequence. A “truncated” SGSH sequence as used herein refers to a catalytically active fragment of the naturally occurring full-length polypeptide chain. The structure of human SGSH has been well-characterized. An illustrative structure is available under PDB accession code 4MHX. Non-human primate SGSH sequences have also been described, including chimpanzee (UniProt entry K7C218). A mouse SGSH sequence is available under Uniprot entry Q9EQ08. An SGSH variant has at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, or at least 95% of the activity of the corresponding wild-type SGSH or fragment thereof, e.g., when assayed under identical conditions. A catalytically active SGSH fragment has at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, or at least 95% of the activity of the corresponding full-length SGSH or variant thereof, e.g., when assayed under identical conditions.

An “acid sphingomyelinase,” “sphingomyelin phosphodiesterase,” or “ASM” as used herein refers to sphingomyelin phosphodiesterase 1 (EC 3.1.4.12), which is a lysosomal enzyme that converts sphingomyelin to ceramide. Diseases associated with ASM deficiency include Niemann-Pick disease (e.g., Type A or Type B). The term “ASM” as used herein as a component of a protein that comprises an Fc polypeptide is catalytically active and encompasses functional variants, including allelic and splice variants, of a wild-type ASM or a fragment thereof. The sequence of human ASM isoform 1, which is the human sequence designated as the canonical sequence, is available under UniProt entry P17405 and is encoded by the human SMPD1 gene at 11p15.4. The full-length sequence is provided as SEQ ID NO:121. A “mature” ASM sequence as used herein refers to a form of a polypeptide chain that lacks the signal sequence of the naturally occurring full-length polypeptide chain. The amino acid sequence of a mature human ASM polypeptide is provided as SEQ ID NO:122, which corresponds to amino acids 47-629 of the full-length human sequence. A “truncated” ASM sequence as used herein refers to a catalytically active fragment of the naturally occurring full-length polypeptide chain. The amino acid sequence of an exemplary truncated human ASM polypeptide is provided as SEQ ID NO:123, which corresponds to amino acids 47-620 of the full-length human sequence. The structure of human ASM has been well-characterized. An illustrative structure is available under PDB accession code 5181. Non-human primate ASM sequences have also been described, including chimpanzee (UniProt entry H2Q319). A mouse ASM sequence is available under Uniprot entry Q04519. An ASM variant has at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, or at least 95% of the activity of the corresponding wild-type ASM or fragment thereof, e.g., when assayed under identical conditions. A catalytically active ASM fragment has at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, or at least 95% of the activity of the corresponding full-length ASM or variant thereof, e.g., when assayed under identical conditions.

A “β-glucocerebrosidase” or “GBA” is also known as glucosylceramidase (EC 3.2.1.45). The term as used herein refers to a lysosomal enzyme that has glucosylceramidase activity and catalyzes the breakdown of glucosylceramide to ceramide and glucose. Deficiency of GBA is associated with Gaucher's disease and Parkinson's disease. The term “GBA” as used herein as a component of a protein that comprises an Fc polypeptide is catalytically active and encompasses functional variants, including allelic and splice variants, of a wild-type GBA, or a fragment thereof. The sequence of human GBA, long isoform, which is designated as the canonical sequence, is available under UniProt entry P04062-1 and is encoded by the human GBA gene at 1q22. The full-length sequence is provided as SEQ ID NO:93. A “mature” GBA sequence as used herein refers to a form of a polypeptide chain that lacks the signal and propeptide sequences of the naturally occurring full-length polypeptide chain. The amino acid sequence of a mature human GBA polypeptide is provided as SEQ ID NO:94, which corresponds to amino acids 40-536 of the full-length human sequence. A “truncated” GBA sequence as used herein refers to a catalytically active fragment of the naturally occurring full-length polypeptide chain. The structure of human GBA has been well-characterized. Nearly 20 crystal structures of GBA are available. Non-human primate GBA sequences have also been described, including chimpanzee (UniProt entry Q9BDTO) and orangutan (UniProt entry Q5R8E3). A mouse GBA sequence is available under UniProt entry P17439. A GBA variant has at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, or at least 95% of the activity of the corresponding wild-type GBA or fragment thereof, e.g., when assayed under identical conditions. A catalytically active GBA fragment has at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, or at least 95% of the activity of the corresponding full-length GBA or variant thereof, e.g., when assayed under identical conditions.

A “transferrin receptor” or “TfR” as used herein refers to transferrin receptor protein 1. The human transferrin receptor 1 polypeptide sequence is set forth in SEQ ID NO:96. Transferrin receptor protein 1 sequences from other species are also known (e.g., chimpanzee, accession number XP_003310238.1; rhesus monkey, NP_001244232.1; dog, NP_001003111.1; cattle, NP_001193506.1; mouse, NP_035768.1; rat, NP_073203.1; and chicken, NP_990587.1). The term “transferrin receptor” also encompasses allelic variants of exemplary reference sequences, e.g., human sequences, that are encoded by a gene at a transferrin receptor protein 1 chromosomal locus. Full-length transferrin receptor protein includes a short N-terminal intracellular region, a transmembrane region, and a large extracellular domain. The extracellular domain is characterized by three domains: a protease-like domain, a helical domain, and an apical domain. The apical domain sequence of human transferrin receptor 1 is set forth in SEQ ID NO:238.

A “fusion protein” or “[ERT enzyme]-Fc fusion protein” as used herein refers to a dimeric protein comprising a first Fc polypeptide that is linked (e.g., fused) to an ERT enzyme, an ERT enzyme variant, or a catalytically active fragment thereof (i.e., an “[ERT]-Fc fusion polypeptide”); and a second Fc polypeptide that forms an Fc dimer with the first Fc polypeptide. The second Fc polypeptide may also be linked (e.g., fused) to an ERT enzyme, an ERT enzyme variant, or a catalytically active fragment thereof. The first Fc polypeptide and/or the second Fc polypeptide may be linked to the ERT enzyme, ERT enzyme variant, or catalytically active fragment thereof by a peptide bond or by a polypeptide linker. The first Fc polypeptide and/or the second Fc polypeptide may be a modified Fc polypeptide that contains one or more modifications that promote its heterodimerization to the other Fc polypeptide. The first Fc polypeptide and/or the second Fc polypeptide may be a modified Fc polypeptide that contains one or more modifications that confer binding to a transferrin receptor. The first Fc polypeptide and/or the second Fc polypeptide may be a modified Fc polypeptide that contains one or more modifications that reduce effector function. The first Fc polypeptide and/or the second Fc polypeptide may be a modified Fc polypeptide that contains one or more modifications that extend serum half-life.

A “fusion polypeptide” or “[ERT enzyme]-Fc fusion polypeptide” as used herein refers to an Fc polypeptide that is linked (e.g., fused) to an ERT enzyme, an ERT enzyme variant, or a catalytically active fragment thereof. The Fc polypeptide may be linked to the ERT enzyme, ERT enzyme variant, or catalytically active fragment thereof by a peptide bond or by a polypeptide linker. The Fc polypeptide may be a modified Fc polypeptide that contains one or more modifications that promote its heterodimerization to another Fc polypeptide. The Fc polypeptide may be a modified Fc polypeptide that contains one or more modifications that confer binding to a transferrin receptor. The Fc polypeptide may be a modified Fc polypeptide that contains one or more modifications that reduce effector function. The Fc polypeptide may be a modified Fc polypeptide that contains one or more modifications that extend serum half-life.

As used herein, the term “Fc polypeptide” refers to the C-terminal region of a naturally occurring immunoglobulin heavy chain polypeptide that is characterized by an Ig fold as a structural domain. An Fc polypeptide contains constant region sequences including at least the CH2 domain and/or the CH3 domain and may contain at least part of the hinge region. In general, an Fc polypeptide does not contain a variable region.

A “modified Fc polypeptide” refers to an Fc polypeptide that has at least one mutation, e.g., a substitution, deletion or insertion, as compared to a wild-type immunoglobulin heavy chain Fc polypeptide sequence, but retains the overall Ig fold or structure of the native Fc polypeptide.

The term “FcRn” refers to the neonatal Fc receptor. Binding of Fc polypeptides to FcRn reduces clearance and increases serum half-life of the Fc polypeptide. The human FcRn protein is a heterodimer that is composed of a protein of about 50 kDa in size that is similar to a major histocompatibility (MHC) class I protein and a β2-microglobulin of about 15 kDa in size.

As used herein, an “FcRn binding site” refers to the region of an Fc polypeptide that binds to FcRn. In human IgG, the FcRn binding site, as numbered using the EU index, includes T250, L251, M252, 1253, S254, R255, T256, T307, E380, M428, H433, N434, H435, and Y436. These positions correspond to positions 20 to 26, 77, 150, 198, and 203 to 206 of SEQ ID NO:1.

As used herein, a “native FcRn binding site” refers to a region of an Fc polypeptide that binds to FcRn and that has the same amino acid sequence as the region of a naturally occurring Fc polypeptide that binds to FcRn.

The terms “CH3 domain” and “CH2 domain” as used herein refer to immunoglobulin constant region domain polypeptides. For purposes of this application, a CH3 domain polypeptide refers to the segment of amino acids from about position 341 to about position 447 as numbered according to EU, and a CH2 domain polypeptide refers to the segment of amino acids from about position 231 to about position 340 as numbered according to the EU numbering scheme and does not include hinge region sequences. CH2 and CH3 domain polypeptides may also be numbered by the IMGT (ImMunoGeneTics) numbering scheme in which the CH2 domain numbering is 1-110 and the CH3 domain numbering is 1-107, according to the IMGT Scientific chart numbering (IMGT website). CH2 and CH3 domains are part of the Fc region of an immunoglobulin. An Fc region refers to the segment of amino acids from about position 231 to about position 447 as numbered according to the EU numbering scheme, but as used herein, can include at least a part of a hinge region of an antibody. An illustrative hinge region sequence is the human IgG1 hinge sequence EPKSCDKTHTCPPCP (SEQ ID NO:95).

The terms “wild-type,” “native,” and “naturally occurring” with respect to a CH3 or CH2 domain are used herein to refer to a domain that has a sequence that occurs in nature.

As used herein, the term “mutant” with respect to a mutant polypeptide or mutant polynucleotide is used interchangeably with “variant.” A variant with respect to a given wild-type CH3 or CH2 domain reference sequence can include naturally occurring allelic variants. A “non-naturally” occurring CH3 or CH2 domain refers to a variant or mutant domain that is not present in a cell in nature and that is produced by genetic modification, e.g., using genetic engineering technology or mutagenesis techniques, of a native CH3 domain or CH2 domain polynucleotide or polypeptide. A “variant” includes any domain comprising at least one amino acid mutation with respect to wild-type. Mutations may include substitutions, insertions, and deletions.

The term “amino acid” refers to naturally occurring and synthetic amino acids, as well as amino acid analogs and amino acid mimetics that function in a manner similar to the naturally occurring amino acids.

Naturally occurring amino acids are those encoded by the genetic code, as well as those amino acids that are later modified, e.g., hydroxyproline, γ-carboxyglutamate and 0-phosphoserine. “Amino acid analogs” refers to compounds that have the same basic chemical structure as a naturally occurring amino acid, i.e., an a carbon that is bound to a hydrogen, a carboxyl group, an amino group, and an R group, e.g., homoserine, norleucine, methionine sulfoxide, methionine methyl sulfonium. Such analogs have modified R groups (e.g., norleucine) or modified peptide backbones, but retain the same basic chemical structure as a naturally occurring amino acid. “Amino acid mimetics” refers to chemical compounds that have a structure that is different from the general chemical structure of an amino acid, but that function in a manner similar to a naturally occurring amino acid.

Naturally occurring α-amino acids include, without limitation, alanine (Ala), cysteine (Cys), aspartic acid (Asp), glutamic acid (Glu), phenylalanine (Phe), glycine (Gly), histidine (His), isoleucine (Ile), arginine (Arg), lysine (Lys), leucine (Leu), methionine (Met), asparagine (Asn), proline (Pro), glutamine (Gln), serine (Ser), threonine (Thr), valine (Val), tryptophan (Trp), tyrosine (Tyr), and combinations thereof. Stereoisomers of a naturally-occurring α-amino acids include, without limitation, D-alanine (D-Ala), D-cysteine (D-Cys), D-aspartic acid (D-Asp), D-glutamic acid (D-Glu), D-phenylalanine (D-Phe), D-histidine (D-His), D-isoleucine (D-Ile), D-arginine (D-Arg), D-lysine (D-Lys), D-leucine (D-Leu), D-methionine (D-Met), D-asparagine (D-Asn), D-proline (D-Pro), D-glutamine (D-Gln), D-serine (D-Ser), D-threonine (D-Thr), D-valine (D-Val), D-tryptophan (D-Trp), D-tyrosine (D-Tyr), and combinations thereof.

Amino acids may be referred to herein by either their commonly known three letter symbols or by the one-letter symbols recommended by the IUPAC-IUB Biochemical Nomenclature Commission.

The terms “polypeptide” and “peptide” are used interchangeably herein to refer to a polymer of amino acid residues in a single chain. The terms apply to amino acid polymers in which one or more amino acid residue is an artificial chemical mimetic of a corresponding naturally occurring amino acid, as well as to naturally occurring amino acid polymers and non-naturally occurring amino acid polymers. Amino acid polymers may comprise entirely L-amino acids, entirely D-amino acids, or a mixture of L and D amino acids.

The term “protein” as used herein refers to either a polypeptide or a dimer (i.e, two) or multimer (i.e., three or more) of single chain polypeptides. The single chain polypeptides of a protein may be joined by a covalent bond, e.g., a disulfide bond, or non-covalent interactions.

The term “conservative substitution,” “conservative mutation,” or “conservatively modified variant” refers to an alteration that results in the substitution of an amino acid with another amino acid that can be categorized as having a similar feature. Examples of categories of conservative amino acid groups defined in this manner can include: a “charged/polar group” including Glu (Glutamic acid or E), Asp (Aspartic acid or D), Asn (Asparagine or N), Gln (Glutamine or Q), Lys (Lysine or K), Arg (Arginine or R), and His (Histidine or H); an “aromatic group” including Phe (Phenylalanine or F), Tyr (Tyrosine or Y), Trp (Tryptophan or W), and (Histidine or H); and an “aliphatic group” including Gly (Glycine or G), Ala (Alanine or A), Val (Valine or V), Leu (Leucine or L), Ile (Isoleucine or I), Met (Methionine or M), Ser (Serine or S), Thr (Threonine or T), and Cys (Cysteine or C). Within each group, subgroups can also be identified. For example, the group of charged or polar amino acids can be sub-divided into sub-groups including: a “positively-charged sub-group” comprising Lys, Arg and His; a “negatively-charged sub-group” comprising Glu and Asp; and a “polar sub-group” comprising Asn and Gln. In another example, the aromatic or cyclic group can be sub-divided into sub-groups including: a “nitrogen ring sub-group” comprising Pro, His and Trp; and a “phenyl sub-group” comprising Phe and Tyr. In another further example, the aliphatic group can be sub-divided into sub-groups, e.g., an “aliphatic non-polar sub-group” comprising Val, Leu, Gly, and Ala; and an “aliphatic slightly-polar sub-group” comprising Met, Ser, Thr, and Cys. Examples of categories of conservative mutations include amino acid substitutions of amino acids within the sub-groups above, such as, but not limited to: Lys for Arg or vice versa, such that a positive charge can be maintained; Glu for Asp or vice versa, such that a negative charge can be maintained; Ser for Thr or vice versa, such that a free —OH can be maintained; and Gln for Asn or vice versa, such that a free —NH₂ can be maintained. In some embodiments, hydrophobic amino acids are substituted for naturally occurring hydrophobic amino acid, e.g., in the active site, to preserve hydrophobicity.

The terms “identical” or percent “identity,” in the context of two or more polypeptide sequences, refer to two or more sequences or subsequences that are the same or have a specified percentage of amino acid residues, e.g., at least 60% identity, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, or at least 95% or greater, that are identical over a specified region when compared and aligned for maximum correspondence over a comparison window, or designated region, as measured using a sequence comparison algorithm or by manual alignment and visual inspection.

For sequence comparison of polypeptides, typically one amino acid sequence acts as a reference sequence, to which a candidate sequence is compared. Alignment can be performed using various methods available to one of skill in the art, e.g., visual alignment or using publicly available software using known algorithms to achieve maximal alignment. Such programs include the BLAST programs, ALIGN, ALIGN-2 (Genentech, South San Francisco, Calif.) or Megalign (DNASTAR). The parameters employed for an alignment to achieve maximal alignment can be determined by one of skill in the art. For sequence comparison of polypeptide sequences for purposes of this application, the BLASTP algorithm standard protein BLAST for aligning two proteins sequence with the default parameters is used.

The terms “corresponding to,” “determined with reference to,” or “numbered with reference to” when used in the context of the identification of a given amino acid residue in a polypeptide sequence, refers to the position of the residue of a specified reference sequence when the given amino acid sequence is maximally aligned and compared to the reference sequence. Thus, for example, an amino acid residue in a modified Fc polypeptide “corresponds to” an amino acid in SEQ ID NO:1, when the residue aligns with the amino acid in SEQ ID NO:1 when optimally aligned to SEQ ID NO: 1. The polypeptide that is aligned to the reference sequence need not be the same length as the reference sequence.

A “binding affinity” as used herein refers to the strength of the non-covalent interaction between two molecules, e.g., a single binding site on a polypeptide and a target, e.g., transferrin receptor, to which it binds. Thus, for example, the term may refer to 1:1 interactions between a polypeptide and its target, unless otherwise indicated or clear from context. Binding affinity may be quantified by measuring an equilibrium dissociation constant (K_(D)), which refers to the dissociation rate constant (k_(d), time⁻¹) divided by the association rate constant (k_(a), time⁻¹ M⁻¹). K_(D) can be determined by measurement of the kinetics of complex formation and dissociation, e.g., using Surface Plasmon Resonance (SPR) methods, e.g., a Biacore™ system; kinetic exclusion assays such as KinExA®; and BioLayer interferometry (e.g., using the ForteBio® Octet® platform). As used herein, “binding affinity” includes not only formal binding affinities, such as those reflecting 1:1 interactions between a polypeptide and its target, but also apparent affinities for which K_(D)'s are calculated that may reflect avid binding.

As used herein, the term “specifically binds” or “selectively binds” to a target, e.g., TfR, when referring to an engineered TfR-binding polypeptide, TfR-binding peptide, or TfR-binding antibody as described herein, refers to a binding reaction whereby the engineered TfR-binding polypeptide, TfR-binding peptide, or TfR-binding antibody binds to the target with greater affinity, greater avidity, and/or greater duration than it binds to a structurally different target. In typical embodiments, the engineered TfR-binding polypeptide, TfR-binding peptide, or TfR-binding antibody has at least 5-fold, 10-fold, 50-fold, 100-fold, 1,000-fold, 10,000-fold, or greater affinity for a specific target, e.g., TfR, compared to an unrelated target when assayed under the same affinity assay conditions. The term “specific binding,” “specifically binds to,” or “is specific for” a particular target (e.g., TfR), as used herein, can be exhibited, for example, by a molecule having an equilibrium dissociation constant K_(D) for the target to which it binds of, e.g., 10⁻⁴ M or smaller, e.g., 10⁻⁵ M, 10⁻⁶ M, 10⁻⁷M, 10⁻⁸M, 10⁻⁹M, 10⁻¹⁰ M, 10⁻¹¹M, or 10⁻¹² M. In some embodiments, an engineered TfR-binding polypeptide, TfR-binding peptide, or TfR-binding antibody specifically binds to an epitope on TfR that is conserved among species, (e.g., structurally conserved among species), e.g., conserved between non-human primate and human species (e.g., structurally conserved between non-human primate and human species). In some embodiments, an engineered TfR-binding polypeptide, TfR-binding peptide, or TfR-binding antibody may bind exclusively to a human TfR.

The term “variable region” or “variable domain” refers to a domain in an antibody heavy chain or light chain that is derived from a germline Variable (V) gene, Diversity (D) gene, or Joining (J) gene (and not derived from a Constant (Cμ and Cδ) gene segment), and that gives an antibody its specificity for binding to an antigen. Typically, an antibody variable region comprises four conserved “framework” regions interspersed with three hypervariable “complementarity determining regions.”

The terms “antigen-binding portion” and “antigen-binding fragment” are used interchangeably herein and refer to one or more fragments of an antibody that retains the ability to specifically bind to an antigen via its variable region. Examples of antigen-binding fragments include, but are not limited to, a Fab fragment (a monovalent fragment consisting of the VL, VH, CL, and CH1 domains), a F(ab′)2 fragment (a bivalent fragment comprising two Fab fragments linked by a disulfide bridge at the hinge region), a single chain Fv (scFv), a disulfide-linked Fv (dsFv), complementarity determining regions (CDRs), a VL (light chain variable region), and a VH (heavy chain variable region).

The terms “treatment,” “treating,” and the like are used herein to generally mean obtaining a desired pharmacologic and/or physiologic effect. “Treating” or “treatment” may refer to any indicia of success in the treatment or amelioration of a lysosomal storage disorder, e.g., Hunter syndrome, Sanfilippo syndrome A, Niemann-Pick disease, Gaucher's disease, or Parkinson's disease, including any objective or subjective parameter such as abatement, remission, improvement in patient survival, increase in survival time or rate, diminishing of symptoms or making the disorder more tolerable to the patient, slowing in the rate of degeneration or decline, or improving a patient's physical or mental well-being. The treatment or amelioration of symptoms can be based on objective or subjective parameters. The effect of treatment can be compared to an individual or pool of individuals not receiving the treatment, or to the same patient prior to treatment or at a different time during treatment.

The term “subject,” “individual,” and “patient,” as used interchangeably herein, refer to a mammal, including but not limited to humans, non-human primates, rodents (e.g., rats, mice, and guinea pigs), rabbits, cows, pigs, horses, and other mammalian species. In one embodiment, the patient is a human.

The term “pharmaceutically acceptable excipient” refers to a non-active pharmaceutical ingredient that is biologically or pharmacologically compatible for use in humans or animals, such as but not limited to a buffer, carrier, or preservative.

As used herein, a “therapeutic amount,” “therapeutically effective amount,” or “therapeutically effective concentration” of an agent is an amount or concentration of the agent that treats signs or symptoms of a disease (e.g., an LSD) in the subject (e.g., mammal).

The term “administer” refers to a method of delivering agents, compounds, or compositions to the desired site of biological action. These methods include, but are not limited to, topical delivery, parenteral delivery, intravenous delivery, intradermal delivery, intramuscular delivery, intrathecal delivery, colonic delivery, rectal delivery, or intraperitoneal delivery. In one embodiment, the polypeptides described herein are administered intravenously.

III. Enzyme Replacement Therapy (ERT) Enzymes

Lysosomal storage disorders (LSDs) are inherited metabolic diseases characterized by the accumulation of undigested or partially digested macromolecules, which ultimately results in cellular dysfunction and clinical abnormalities. Classically, LSDs have been defined as deficiencies in lysosomal function generally classified by the accumulated substrate and include sphingolipidoses, oligosaccharidoses, mucolipidoses, mucopolysaccharidoses, lipoprotein storage disorders, neuronal ceroid lipofuscinoses, and others. The classification of these disorders has recently been expanded to include other deficiencies or defects in proteins that result in accumulation of macromolecules, such as proteins necessary for normal post-translational modification of lysosomal enzymes, or proteins important for proper lysosomal trafficking.

In some aspects, a fusion protein described herein comprises: (i) an Fc polypeptide, which may contain modifications (e.g., one or more modifications that promote heterodimerization) or may be a wild-type Fc polypeptide; and an ERT enzyme; and (ii) an Fc polypeptide, which may contain modifications (e.g., one or more modifications that promote heterodimerization) or may be a wild-type Fc polypeptide; and optionally an ERT enzyme. In some embodiments, one or both Fc polypeptides may contain modifications that result in binding to a blood-brain barrier (BBB) receptor, e.g., a transferrin receptor (TfR). The ERT enzyme may be any enzyme that is deficient in an LSD. An ERT enzyme incorporated into the fusion protein is catalytically active, i.e., it retains the enzymatic activity that is deficient in the LSD. In some embodiments, the ERT enzyme is iduronate 2-sulfatase (IDS), which is deficient in Hunter syndrome. In some embodiments, the ERT enzyme is N-sulfoglucosamine sulfohydrolase (SGSH), which is deficient in Sanfilippo syndrome. In some embodiments, the ERT enzyme is acid sphingomyelinase (ASM), which is deficient in Niemann-Pick disease. In some embodiments, the ERT enzyme is β-glucocerebrosidase (GBA), which is deficient in Gaucher's disease and Parkinson's disease.

In some embodiments, a fusion protein comprising an ERT enzyme and optionally a modified Fc polypeptide that binds to a BBB receptor, e.g., a TfR-binding Fc polypeptide, comprises a catalytically active fragment or variant of a wild-type IDS. In some embodiments, the IDS enzyme is a variant or a catalytically active fragment of an IDS protein that comprises the amino acid sequence of any one of SEQ ID NOS:91, 92, 114, 230, and 234. In some embodiments, a catalytically active variant or fragment of an IDS enzyme has at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, or greater of the activity of the wild-type IDS enzyme.

In some embodiments, a fusion protein comprising an ERT enzyme and optionally a modified Fc polypeptide that binds to a BBB receptor, e.g., a TfR-binding Fc polypeptide, comprises a catalytically active fragment or variant of a wild-type SGSH. In some embodiments, the SGSH enzyme is a variant or a catalytically active fragment of an SGSH protein that comprises the amino acid sequence of any one of SEQ ID NOS:119 and 120. In some embodiments, a catalytically active variant or fragment of an SGSH enzyme has at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, or greater of the activity of the wild-type SGSH enzyme.

In some embodiments, a fusion protein comprising an ERT enzyme and optionally a modified Fc polypeptide that binds to a BBB receptor, e.g., a TfR-binding Fc polypeptide, comprises a catalytically active fragment or variant of a wild-type ASM. In some embodiments, the ASM enzyme is a variant or a catalytically active fragment of an ASM protein that comprises the amino acid sequence of any one of SEQ ID NOS:121, 122, and 123. In some embodiments, a catalytically active variant or fragment of an ASM enzyme has at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, or greater of the activity of the wild-type ASM enzyme.

In some embodiments, a fusion protein comprising an ERT enzyme and optionally a modified Fc polypeptide that binds to a BBB receptor, e.g., a TfR-binding Fc polypeptide, comprises a catalytically active fragment or variant of a wild-type GBA. In some embodiments, the GBA enzyme is a variant or a catalytically active fragment of a GBA protein that comprises the amino acid sequence of any one of SEQ ID NOS:93 and 94. In some embodiments, a catalytically active variant or fragment of a GBA enzyme has at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, or greater of the activity of the wild-type GBA enzyme.

In some embodiments, an ERT enzyme, e.g., IDS, SGSH, ASM, or GBA, or a catalytically active variant or fragment thereof, that is present in a fusion protein described herein, retains at least 25% of its activity compared to its activity when not joined to an Fc polypeptide or a TfR-binding Fc polypeptide. In some embodiments, an ERT enzyme, or a catalytically active variant or fragment thereof, retains at least 10%, or at least 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, or 95%, of its activity compared to its activity when not joined to an Fc polypeptide or a TfR-binding Fc polypeptide. In some embodiments, an ERT enzyme, or a catalytically active variant or fragment thereof, retains at least 80%, 85%, 90%, or 95% of its activity compared to its activity when not joined to an Fc polypeptide or a TfR-binding Fc polypeptide. In some embodiments, fusion to an Fc polypeptide does not decrease the activity of the ERT enzyme, e.g., IDS, SGSH, ASM, or GBA, or catalytically active variant or fragment thereof. In some embodiments, fusion to a TfR-binding Fc polypeptide does not decrease the activity of the ERT enzyme.

IV. FC Polypeptide Modifications for Blood-Brain Barrier (BBB) Receptor Binding

In some aspects, provided herein are fusion proteins that are capable of being transported across the blood-brain barrier (BBB). Such a protein comprises a modified Fc polypeptide that binds to a BBB receptor. BBB receptors are expressed on BBB endothelia, as well as other cell and tissue types. In some embodiments, the BBB receptor is transferrin receptor (TfR).

Amino acid residues designated in various Fc modifications, including those introduced in a modified Fc polypeptide that binds to a BBB receptor, e.g., TfR, are numbered herein using EU index numbering. Any Fc polypeptide, e.g., an IgG1, IgG2, IgG3, or IgG4 Fc polypeptide, may have modifications, e.g., amino acid substitutions, in one or more positions as described herein.

A modified (e.g., enhancing heterodimerization and/or BBB receptor-binding) Fc polypeptide present in a fusion protein described herein can have at least 70% identity, at least 75% identity, at least 80% identity, at least 85% identity, at least 90% identity, or at least 95% identity to a native Fc region sequence or a fragment thereof, e.g., a fragment of at least 50 amino acids or at least 100 amino acids, or greater in length. In some embodiments, the native Fc amino acid sequence is the Fc region sequence of SEQ ID NO:1. In some embodiments, the modified Fc polypeptide has at least 70% identity, at least 75% identity, at least 80% identity, at least 85% identity, at least 90% identity, or at least 95% identity to amino acids 1-110 of SEQ ID NO:1, or to amino acids 111-217 of SEQ ID NO:1, or a fragment thereof, e.g., a fragment of at least 50 amino acids or at least 100 amino acids, or greater in length.

In some embodiments, a modified (e.g., enhancing heterodimerization and/or BBB receptor-binding) Fc polypeptide comprises at least 50 amino acids, or at least 60, 65, 70, 75, 80, 85, 90, or 95 or more, or at least 100 amino acids, or more, that correspond to a native Fc region amino acid sequence. In some embodiments, the modified Fc polypeptide comprises at least 25 contiguous amino acids, or at least 30, 35, 40, or 45 contiguous amino acids, or 50 contiguous amino acids, or at least 60, 65, 70, 75, 80 85, 90, or 95 or more contiguous amino acids, or 100 or more contiguous amino acids, that correspond to a native Fc region amino acid sequence, such as SEQ ID NO:1.

In some embodiments, the domain that is modified for BBB receptor-binding activity is a human Ig CH3 domain, such as an IgG1 CH3 domain. The CH3 domain can be of any IgG subtype, i.e., from IgG1, IgG2, IgG3, or IgG4. In the context of IgG1 antibodies, a CH3 domain refers to the segment of amino acids from about position 341 to about position 447 as numbered according to the EU numbering scheme.

In some embodiments, the domain that is modified for BBB receptor-binding activity is a human Ig CH2 domain, such as an IgG CH2 domain. The CH2 domain can be of any IgG subtype, i.e., from IgG1, IgG2, IgG3, or IgG4. In the context of IgG1 antibodies, a CH2 domain refers to the segment of amino acids from about position 231 to about position 340 as numbered according to the EU numbering scheme.

In some embodiments, a modified (e.g., BBB receptor-binding) Fc polypeptide present in a fusion protein described herein comprises at least one, two, or three substitutions; and in some embodiments, at least four five, six, seven, eight, nine, or ten substitutions at amino acid positions comprising 266, 267, 268, 269, 270, 271, 295, 297, 298, and 299, according to the EU numbering scheme.

In some embodiments, a modified (e.g., BBB receptor-binding) Fc polypeptide present in a fusion protein described herein comprises at least one, two, or three substitutions; and in some embodiments, at least four, five, six, seven, eight, or nine substitutions at amino acid positions comprising 274, 276, 283, 285, 286, 287, 288, 289, and 290, according to the EU numbering scheme.

In some embodiments, a modified (e.g., BBB receptor-binding) Fc polypeptide present in a fusion protein described herein comprises at least one, two, or three substitutions; and in some embodiments, at least four, five, six, seven, eight, nine, or ten substitutions at amino acid positions comprising 268, 269, 270, 271, 272, 292, 293, 294, 296, and 300, according to the EU numbering scheme.

In some embodiments, a modified (e.g., BBB receptor-binding) Fc polypeptide present in a fusion protein described herein comprises at least one, two, or three substitutions; and in some embodiments, at least four, five, six, seven, eight, or nine substitutions at amino acid positions comprising 272, 274, 276, 322, 324, 326, 329, 330, and 331, according to the EU numbering scheme.

In some embodiments, a modified (e.g., BBB receptor-binding) Fc polypeptide present in a fusion protein described herein comprises at least one, two, or three substitutions; and in some embodiments, at least four, five, six, or seven substitutions at amino acid positions comprising 345, 346, 347, 349, 437, 438, 439, and 440, according to the EU numbering scheme.

In some embodiments, a modified (e.g., BBB receptor-binding) Fc polypeptide present in a fusion protein described herein comprises at least one, two, or three substitutions; and in some embodiments, at least four, five, six, seven, eight, or nine substitutions at amino acid positions 384, 386, 387, 388, 389, 390, 413, 416, and 421, according to the EU numbering scheme.

FcRn Binding Sites

In certain aspects, modified (e.g., BBB receptor-binding) Fc polypeptides, or Fc polypeptides present in a fusion protein described herein that do not specifically bind to a BBB receptor, can also comprise an FcRn binding site. In some embodiments, the FcRn binding site is within the Fc polypeptide or a fragment thereof.

In some embodiments, the FcRn binding site comprises a native FcRn binding site. In some embodiments, the FcRn binding site does not comprise amino acid changes relative to the amino acid sequence of a native FcRn binding site. In some embodiments, the native FcRn binding site is an IgG binding site, e.g., a human IgG binding site. In some embodiments, the FcRn binding site comprises a modification that alters FcRn binding.

In some embodiments, an FcRn binding site has one or more amino acid residues that are mutated, e.g., substituted, wherein the mutation(s) increase serum half-life or do not substantially reduce serum half-life (i.e., reduce serum half-life by no more than 25% compared to a counterpart modified Fc polypeptide having the wild-type residues at the mutated positions when assayed under the same conditions). In some embodiments, an FcRn binding site has one or more amino acid residues that are substituted at positions 250-256, 307, 380, 428, and 433-436, according to the EU numbering scheme.

In some embodiments, one or more residues at or near an FcRn binding site are mutated, relative to a native human IgG sequence, to extend serum half-life of the modified polypeptide. In some embodiments, mutations are introduced into one, two, or three of positions 252, 254, and 256. In some embodiments, the mutations are M252Y, S254T, and T256E. In some embodiments, a modified Fc polypeptide further comprises the mutations M252Y, S254T, and T256E. In some embodiments, a modified Fc polypeptide comprises a substitution at one, two, or all three of positions T307, E380, and N434, according to the EU numbering scheme. In some embodiments, the mutations are T307Q and N434A. In some embodiments, a modified Fc polypeptide comprises mutations T307A, E380A, and N434A. In some embodiments, a modified Fc polypeptide comprises substitutions at positions T250 and M428, according to the EU numbering scheme. In some embodiments, the modified Fc polypeptide comprises mutations T250Q and/or M428L. In some embodiments, a modified Fc polypeptide comprises substitutions at positions M428 and N434, according to the EU numbering scheme. In some embodiments, the modified Fc polypeptide comprises mutations M428L and N434S. In some embodiments, a modified Fc polypeptide comprises an N434S or N434A mutation.

V. Transferrin Receptor-Binding Fc Polypeptides

This section describes generation of modified Fc polypeptides described herein that bind to transferrin receptor (TfR) and are capable of being transported across the blood-brain barrier (BBB).

TfR-Binding Fc Polypeptides Comprising Mutations in the CH3 Domain

In some embodiments, a modified Fc polypeptide that specifically binds to TfR comprises substitutions in a CH3 domain. In some embodiments, a modified Fc polypeptide comprises a human Ig CH3 domain, such as an IgG CH3 domain, that is modified for TfR-binding activity. The CH3 domain can be of any IgG subtype, i.e., from IgG1, IgG2, IgG3, or IgG4. In the context of IgG antibodies, a CH3 domain refers to the segment of amino acids from about position 341 to about position 447 as numbered according to the EU numbering scheme.

In some embodiments, a modified Fc polypeptide that specifically binds to TfR binds to the apical domain of TfR and may bind to TfR without blocking or otherwise inhibiting binding of transferrin to TfR. In some embodiments, binding of transferrin to TfR is not substantially inhibited. In some embodiments, binding of transferrin to TfR is inhibited by less than about 50% (e.g., less than about 45%, 40%, 35%, 30%, 25%, 20%, 15%, 10%, or 5%). In some embodiments, binding of transferrin to TfR is inhibited by less than about 20% (e.g., less than about 19%, 18%, 17%, 16%, 15%, 14%, 13%, 12%, 11%, 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, or 1%).

In some embodiments, a modified Fc polypeptide that specifically binds to TfR comprises at least two, three, four, five, six, seven, eight, or nine substitutions at positions 384, 386, 387, 388, 389, 390, 413, 416, and 421, according to the EU numbering scheme. Illustrative substitutions that may be introduced at these positions are shown in Tables 4 and 5. In some embodiments, the amino acid at position 388 and/or 421 is an aromatic amino acid, e.g., Trp, Phe, or Tyr. In some embodiments, the amino acid at position 388 is Trp. In some embodiments, the aromatic amino acid at position 421 is Trp or Phe.

In some embodiments, at least one position as follows is substituted: Leu, Tyr, Met, or Val at position 384; Leu, Thr, His, or Pro at position 386; Val, Pro, or an acidic amino acid at position 387; an aromatic amino acid, e.g., Trp at position 388; Val, Ser, or Ala at position 389; an acidic amino acid, Ala, Ser, Leu, Thr, or Pro at position 413; Thr or an acidic amino acid at position 416; or Trp, Tyr, His, or Phe at position 421. In some embodiments, the modified Fc polypeptide may comprise a conservative substitution, e.g., an amino acid in the same charge grouping, hydrophobicity grouping, side chain ring structure grouping (e.g., aromatic amino acids), or size grouping, and/or polar or non-polar grouping, of a specified amino acid at one or more of the positions in the set. Thus, for example, Ile may be present at position 384, 386, and/or position 413. In some embodiments, the acidic amino acid at position one, two, or each of positions 387, 413, and 416 is Glu. In other embodiments, the acidic amino acid at one, two or each of positions 387, 413, and 416 is Asp. In some embodiments, two, three, four, five, six, seven, or all eight of positions 384, 386, 387, 388, 389, 413, 416, and 421 have an amino acid substitution as specified in this paragraph.

In some embodiments, an Fc polypeptide that is modified as described in the preceding two paragraphs comprises a native Asn at position 390. In some embodiments, the modified Fc polypeptide comprises Gly, His, Gln, Leu, Lys, Val, Phe, Ser, Ala, or Asp at position 390. In some embodiments, the modified Fc polypeptide further comprises one, two, three, or four substitutions at positions comprising 380, 391, 392, and 415, according to the EU numbering scheme. In some embodiments, Trp, Tyr, Leu, or Gln may be present at position 380. In some embodiments, Ser, Thr, Gln, or Phe may be present at position 391. In some embodiments, Gln, Phe, or His may be present at position 392. In some embodiments, Glu may be present at position 415.

In certain embodiments, the modified Fc polypeptide comprises two, three, four, five, six, seven, eight, nine, ten, or eleven positions selected from the following: Trp, Leu, or Glu at position 380; Tyr or Phe at position 384; Thr at position 386; Glu at position 387; Trp at position 388; Ser, Ala, Val, or Asn at position 389; Ser or Asn at position 390; Thr or Ser at position 413; Glu or Ser at position 415; Glu at position 416; and/or Phe at position 421. In some embodiments, the modified Fc polypeptide comprises all eleven positions as follows: Trp, Leu, or Glu at position 380; Tyr or Phe at position 384; Thr at position 386; Glu at position 387; Trp at position 388; Ser, Ala, Val, or Asn at position 389; Ser or Asn at position 390; Thr or Ser at position 413; Glu or Ser at position 415; Glu at position 416; and/or Phe at position 421.

In certain embodiments, the modified Fc polypeptide comprises Leu or Met at position 384; Leu, His, or Pro at position 386; Val at position 387; Trp at position 388; Val or Ala at position 389; Pro at position 413; Thr at position 416; and/or Trp at position 421. In some embodiments, the modified Fc polypeptide further comprises Ser, Thr, Gln, or Phe at position 391. In some embodiments, the modified Fc polypeptide further comprises Trp, Tyr, Leu, or Gln at position 380 and/or Gln, Phe, or His at position 392. In some embodiments, Trp is present at position 380 and/or Gln is present at position 392. In some embodiments, the modified Fc polypeptide does not have a Trp at position 380.

In other embodiments, the modified Fc polypeptide comprises Tyr at position 384; Thr at position 386; Glu or Val and position 387; Trp at position 388; Ser at position 389; Ser or Thr at position 413; Glu at position 416; and/or Phe at position 421. In some embodiments, the modified Fc polypeptide comprises a native Asn at position 390. In certain embodiments, the modified Fc polypeptide further comprises Trp, Tyr, Leu, or Gln at position 380; and/or Glu at position 415. In some embodiments, the modified Fc polypeptide further comprises Trp at position 380 and/or Glu at position 415.

In additional embodiments, the modified Fc polypeptide further comprises one, two, or three substitutions at positions comprising 414, 424, and 426, according to the EU numbering scheme. In some embodiments, position 414 is Lys, Arg, Gly, or Pro; position 424 is Ser, Thr, Glu, or Lys; and/or position 426 is Ser, Trp, or Gly.

In some embodiments, the modified Fc polypeptide comprises one or more of the following substitutions: Trp at position 380; Thr at position 386; Trp at position 388; Val at position 389; Thr or Ser at position 413; Glu at position 415; and/or Phe at position 421, according to the EU numbering scheme.

In some embodiments, the modified Fc polypeptide has at least 70% identity, at least 75% identity, at least 80% identity, at least 85% identity, at least 90% identity, or at least 95% identity to amino acids 111-217 of any one of SEQ ID NOS:4-90, 97-100, and 105-108 (e.g., SEQ ID NOS:34-38, 58, and 60-90). In some embodiments, the modified Fc polypeptide has at least 70% identity, at least 75% identity, at least 80% identity, at least 85% identity, at least 90% identity, or at least 95% identity to any one of SEQ ID NOS:4-90, 97-100, and 105-108 (e.g., SEQ ID NOS:34-38, 58, and 60-90). In some embodiments, the modified Fc polypeptide comprises the amino acids at EU index positions 384-390 and/or 413-421 of any one of SEQ ID NOS:4-90, 97-100, and 105-108 (e.g., SEQ ID NOS:34-38, 58, and 60-90). In some embodiments, the modified Fc polypeptide comprises the amino acids at EU index positions 380-390 and/or 413-421 of any one of 4-90, 97-100, and 105-108 (e.g., SEQ ID NOS:34-38, 58, and 60-90). In some embodiments, the modified Fc polypeptide comprises the amino acids at EU index positions 380-392 and/or 413-426 of any one of SEQ ID NOS:4-90, 97-100, and 105-108 (e.g., SEQ ID NOS:34-38, 58, and 60-90).

In some embodiments, the modified Fc polypeptide has at least 75% identity, at least 80% identity, at least 85% identity, at least 90% identity, or at least 95% identity to any one of SEQ ID NOS:4-90, 97-100, and 105-108 (e.g., SEQ ID NOS:34-38, 58, and 60-90), and further comprises at least five, six, seven, eight, nine, ten, eleven, twelve, thirteen, fourteen, fifteen, or sixteen of the positions, numbered according to the EU index, as follows: Trp, Tyr, Leu, Gln, or Glu at position 380; Leu, Tyr, Met, or Val at position 384; Leu, Thr, His, or Pro at position 386; Val, Pro, or an acidic amino acid at position 387; an aromatic amino acid, e.g., Trp, at position 388; Val, Ser, or Ala at position 389; Ser or Asn at position 390; Ser, Thr, Gln, or Phe at position 391; Gln, Phe, or His at position 392; an acidic amino acid, Ala, Ser, Leu, Thr, or Pro at position 413; Lys, Arg, Gly or Pro at position 414; Glu or Ser at position 415; Thr or an acidic amino acid at position 416; Trp, Tyr, His or Phe at position 421; Ser, Thr, Glu or Lys at position 424; and Ser, Trp, or Gly at position 426.

In some embodiments, the modified Fc polypeptide comprises the amino acid sequence of any one of SEQ ID NOS:34-38, 58, and 60-90. In other embodiments, the modified Fc polypeptide comprises the amino acid sequence of any one of SEQ ID NOS:34-38, 58, and 60-90, but in which one, two, or three amino acids are substituted.

In some embodiments, the modified Fc polypeptide comprises additional mutations such as the mutations described in Section VI below, including, but not limited to, a knob mutation (e.g., T366W as numbered with reference to EU numbering), hole mutations (e.g., T366S, L368A, and Y407V as numbered with reference to EU numbering), mutations that modulate effector function (e.g., L234A, L235A, and/or P329G (e.g., L234A and L235A) as numbered with reference to EU numbering), and/or mutations that increase serum stability or serum half-life (e.g., (i) M252Y, S254T, and T256E as numbered with reference to EU numbering, or (ii) N434S with or without M428L as numbered according to the EU numbering scheme). By way of illustration, SEQ ID NOS:156-229 provide non-limiting examples of modified Fc polypeptides with mutations in the CH3 domain (e.g., clones CH3C.35.20.1, CH3C.35.23.2, CH3C.35.23.3, CH3C.35.23.4, CH3C.35.21.17.2, and CH3C.35.23) comprising one or more of these additional mutations.

In some embodiments, the modified Fc polypeptide comprises a knob mutation (e.g., T366W as numbered with reference to EU numbering) and has at least 85% identity, at least 90% identity, or at least 95% identity to the sequence of any one of SEQ ID NOS:156, 168, 180, 192, 204, and 216. In some embodiments, the modified Fc polypeptide comprises the sequence of any one of SEQ ID NOS:156, 168, 180, 192, 204, and 216.

In some embodiments, the modified Fc polypeptide comprises a knob mutation (e.g., T366W as numbered with reference to EU numbering) and mutations that modulate effector function (e.g., L234A, L235A, and/or P329G (e.g., L234A and L235A) as numbered with reference to EU numbering), and has at least 85% identity, at least 90% identity, or at least 95% identity to the sequence of any one of SEQ ID NOS:157, 158, 169, 170, 181, 182, 193, 194, 205, 206, 217, 218, 228, and 229. In some embodiments, the modified Fc polypeptide comprises the sequence of any one of SEQ ID NOS:157, 158, 169, 170, 181, 182, 193, 194, 205, 206, 217, and 218.

In some embodiments, the modified Fc polypeptide comprises a knob mutation (e.g., T366W as numbered with reference to EU numbering) and mutations that increase serum stability or serum half-life (e.g., (i) M252Y, S254T, and T256E as numbered with reference to EU numbering, or (ii) N434S with or without M428L as numbered according to the EU numbering scheme), and has at least 85% identity, at least 90% identity, or at least 95% identity to the sequence of any one of SEQ ID NOS:159, 171, 183, 195, 207, and 219. In some embodiments, the modified Fc polypeptide comprises the sequence of any one of SEQ ID NOS:159, 171, 183, 195, 207, and 219.

In some embodiments, the modified Fc polypeptide comprises a knob mutation (e.g., T366W as numbered with reference to EU numbering), mutations that modulate effector function (e.g., L234A, L235A, and/or P329G (e.g., L234A and L235A) as numbered with reference to EU numbering), and mutations that increase serum stability or serum half-life (e.g., (i) M252Y, S254T, and T256E as numbered with reference to EU numbering, or (ii) N434S with or without M428L as numbered according to the EU numbering scheme), and has at least 85% identity, at least 90% identity, or at least 95% identity to the sequence of any one of SEQ ID NOS:160, 161, 172, 173, 184, 185, 196, 197, 208, 209, 220, and 221. In some embodiments, the modified Fc polypeptide comprises the sequence of any one of SEQ ID NOS:160, 161, 172, 173, 184, 185, 196, 197, 208, 209, 220, and 221.

In some embodiments, the modified Fc polypeptide comprises hole mutations (e.g., T366S, L368A, and Y407V as numbered with reference to EU numbering) and has at least 85% identity, at least 90% identity, or at least 95% identity to the sequence of any one of SEQ ID NOS:162, 174, 186, 198, 210, and 222. In some embodiments, the modified Fc polypeptide comprises the sequence of any one of SEQ ID NOS:162, 174, 186, 198, 210, and 222.

In some embodiments, the modified Fc polypeptide comprises hole mutations (e.g., T366S, L368A, and Y407V as numbered with reference to EU numbering) and mutations that modulate effector function (e.g., L234A, L235A, and/or P329G (e.g., L234A and L235A) as numbered with reference to EU numbering), and has at least 85% identity, at least 90% identity, or at least 95% identity to the sequence of any one of SEQ ID NOS:163, 164, 175, 176, 187, 188, 199, 200, 211, 212, 223, and 224. In some embodiments, the modified Fc polypeptide comprises the sequence of any one of SEQ ID NOS:163, 164, 175, 176, 187, 188, 199, 200, 211, 212, 223, and 224.

In some embodiments, the modified Fc polypeptide comprises hole mutations (e.g., T366S, L368A, and Y407V as numbered with reference to EU numbering) and mutations that increase serum stability or serum half-life (e.g., (i) M252Y, S254T, and T256E as numbered with reference to EU numbering, or (ii) N434S with or without M428L as numbered according to the EU numbering scheme), and has at least 85% identity, at least 90% identity, or at least 95% identity to the sequence of any one of SEQ ID NOS:165, 177, 189, 201, 213, and 225. In some embodiments, the modified Fc polypeptide comprises the sequence of any one of SEQ ID NOS:165, 177, 189, 201, 213, and 225.

In some embodiments, the modified Fc polypeptide comprises hole mutations (e.g., T366S, L368A, and Y407V as numbered with reference to EU numbering), mutations that modulate effector function (e.g., L234A, L235A, and/or P329G (e.g., L234A and L235A) as numbered with reference to EU numbering), and mutations that increase serum stability or serum half-life (e.g., (i) M252Y, S254T, and T256E as numbered with reference to EU numbering, or (ii) N434S with or without M428L as numbered according to the EU numbering scheme), and has at least 85% identity, at least 90% identity, or at least 95% identity to the sequence of any one of SEQ ID NOS:166, 167, 178, 179, 190, 191, 202, 203, 214, 215, 226, and 227. In some embodiments, the modified Fc polypeptide comprises the sequence of any one of SEQ ID NOS:166, 167, 178, 179, 190, 191, 202, 203, 214, 215, 226, and 227.

In some embodiments, a modified Fc polypeptide that specifically binds to TfR comprises at least two, three, four, five, six, seven, or eight substitutions at positions 345, 346, 347, 349, 437, 438, 439, and 440, according to the EU numbering scheme. Illustrative modified Fc polypeptides are provided in SEQ ID NOS:124-128. In some embodiments, the modified Fc polypeptide comprises Gly at position 437; Phe at position 438; and/or Asp at position 440. In some embodiments, Glu is present at position 440. In certain embodiments, the modified Fc polypeptide comprises at least one substitution at a position as follows: Phe or Ile at position 345; Asp, Glu, Gly, Ala, or Lys at position 346; Tyr, Met, Leu, Ile, or Asp at position 347; Thr or Ala at position 349; Gly at position 437; Phe at position 438; His Tyr, Ser, or Phe at position 439; or Asp at position 440. In some embodiments, two, three, four, five, six, seven, or all eight of positions 345, 346, 347, 349, 437, 438, 439, and 440 and have a substitution as specified in this paragraph. In some embodiments, the modified Fc polypeptide may comprise a conservative substitution, e.g., an amino acid in the same charge grouping, hydrophobicity grouping, side chain ring structure grouping (e.g., aromatic amino acids), or size grouping, and/or polar or non-polar grouping, of a specified amino acid at one or more of the positions in the set.

In some embodiments, the modified Fc polypeptide has at least 70% identity, at least 75% identity, at least 80% identity, at least 85% identity, at least 90% identity, or at least 95% identity to amino acids 111-217 of any one of SEQ ID NOS:124-128. In some embodiments, the modified Fc polypeptide has at least 70% identity, at least 75% identity, at least 80% identity, at least 85% identity, at least 90% identity, or at least 95% identity to SEQ ID NOS:124-128. In some embodiments, the modified Fc polypeptide comprises the amino acid sequence of any one of SEQ ID NOS:124-128. In other embodiments, the modified Fc polypeptide comprises the amino acid sequence of any one of SEQ ID NOS:124-128, but in which one, two, or three amino acids are substituted.

TfR-Binding Fc Polypeptides Comprising Mutations in the CH2 Domain

In some embodiments, a modified Fc polypeptide that specifically binds to TfR comprises substitutions in a CH2 domain. In some embodiments, a modified Fc polypeptide comprises a human Ig CH2 domain, such as an IgG CH2 domain, that is modified for TfR-binding activity. The CH2 domain can be of any IgG subtype, i.e., from IgG1, IgG2, IgG3, or IgG4. In the context of IgG antibodies, a CH2 domain refers to the segment of amino acids from about position 231 to about position 340 as numbered according to the EU numbering scheme.

In some embodiments, a modified Fc polypeptide that specifically binds to TfR binds to the apical domain of TfR and may bind to TfR without blocking or otherwise inhibiting binding of transferrin to TfR. In some embodiments, binding of transferrin to TfR is not substantially inhibited. In some embodiments, binding of transferrin to TfR is inhibited by less than about 50% (e.g., less than about 45%, 40%, 35%, 30%, 25%, 20%, 15%, 10%, or 5%). In some embodiments, binding of transferrin to TfR is inhibited by less than about 20% (e.g., less than about 19%, 18%, 17%, 16%, 15%, 14%, 13%, 12%, 11%, 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, or 1%).

In some embodiments, a modified Fc polypeptide that specifically binds to TfR comprises at least two, three, four, five, six, seven, eight, or nine substitutions at positions 274, 276, 283, 285, 286, 287, 288, and 290, according to the EU numbering scheme. Illustrative modified Fc polypeptides are provided in SEQ ID NOS:129-133. In some embodiments, the modified Fc polypeptide comprises Glu at position 287 and/or Trp at position 288. In some embodiments, the modified Fc polypeptide comprises at least one substitution at a position as follows: Glu, Gly, Gln, Ser, Ala, Asn, Tyr, or Trp at position 274; Ile, Val, Asp, Glu, Thr, Ala, or Tyr at position 276; Asp, Pro, Met, Leu, Ala, Asn, or Phe at position 283; Arg, Ser, Ala, or Gly at position 285; Tyr, Trp, Arg, or Val at position 286; Glu at position 287; Trp or Tyr at position 288; Gln, Tyr, His, Ile, Phe, Val, or Asp at position 289; or Leu, Trp, Arg, Asn, Tyr, or Val at position 290. In some embodiments, two, three, four, five, six, seven, eight, or all nine of positions 274, 276, 283, 285, 286, 287, 288, and 290 have a substitution as specified in this paragraph. In some embodiments, the modified Fc polypeptide may comprise a conservative substitution, e.g., an amino acid in the same charge grouping, hydrophobicity grouping, side chain ring structure grouping (e.g., aromatic amino acids), or size grouping, and/or polar or non-polar grouping, of a specified amino acid at one or more of the positions in the set.

In some embodiments, the modified Fc polypeptide comprises Glu, Gly, Gln, Ser, Ala, Asn, or Tyr at position 274; Ile, Val, Asp, Glu, Thr, Ala, or Tyr at position 276 Asp, Pro, Met, Leu, Ala, or Asn at position 283; Arg, Ser, or Ala at position 285; Tyr, Trp, Arg, or Val at position 286; Glu at position 287; Trp at position 288; Gln, Tyr, His, Ile, Phe, or Val at position 289; and/or Leu, Trp, Arg, Asn, or Tyr at position 290. In some embodiments, the modified Fc polypeptide comprises Arg at position 285; Tyr or Trp at position 286; Glu at position 287; Trp at position 288; and/or Arg or Trp at position 290.

In some embodiments, the modified Fc polypeptide has at least 70% identity, at least 75% identity, at least 80% identity, at least 85% identity, at least 90% identity, or at least 95% identity to amino acids 1-110 of any one of SEQ ID NOS:129-133. In some embodiments, the modified Fc polypeptide has at least 70% identity, at least 75% identity, at least 80% identity, at least 85% identity, at least 90% identity, or at least 95% identity to SEQ ID NOS:129-133. In some embodiments, the modified Fc polypeptide comprises the amino acid sequence of any one of SEQ ID NOS:129-133. In other embodiments, the modified Fc polypeptide comprises the amino acid sequence of any one of SEQ ID NOS:129-133, but in which one, two, or three amino acids are substituted.

In some embodiments, a modified Fc polypeptide that specifically binds to TfR comprises at least two, three, four, five, six, seven, eight, nine, or ten substitutions at positions 266, 267, 268, 269, 270, 271, 295, 297, 298, and 299, according to the EU numbering scheme. Illustrative modified Fc polypeptides are provided in SEQ ID NOS:134-138. In some embodiments, the modified Fc polypeptide comprises Pro at position 270, Glu at position 295, and/or Tyr at position 297. In some embodiments, the modified Fc polypeptide comprises at least one substitution at a position as follows: Pro, Phe, Ala, Met, or Asp at position 266; Gln, Pro, Arg, Lys, Ala, Ile, Leu, Glu, Asp, or Tyr at position 267; Thr, Ser, Gly, Met, Val, Phe, Trp, or Leu at position 268; Pro, Val, Ala, Thr, or Asp at position 269; Pro, Val, or Phe at position 270; Trp, Gln, Thr, or Glu at position 271; Glu, Val, Thr, Leu, or Trp at position 295; Tyr, His, Val, or Asp at position 297; Thr, His, Gln, Arg, Asn, or Val at position 298; or Tyr, Asn, Asp, Ser, or Pro at position 299. In some embodiments, two, three, four, five, six, seven, eight, nine, or all ten of positions 266, 267, 268, 269, 270, 271, 295, 297, 298, and 299 have a substitution as specified in this paragraph. In some embodiments, a modified Fc polypeptide may comprise a conservative substitution, e.g., an amino acid in the same charge grouping, hydrophobicity grouping, side chain ring structure grouping (e.g., aromatic amino acids), or size grouping, and/or polar or non-polar grouping, of a specified amino acid at one or more of the positions in the set.

In some embodiments, the modified Fc polypeptide comprises Pro, Phe, or Ala at position 266; Gln, Pro, Arg, Lys, Ala, or Ile at position 267; Thr, Ser, Gly, Met, Val, Phe, or Trp at position 268; Pro, Val, or Ala at position 269; Pro at position 270; Trp or Gln at position 271; Glu at position 295; Tyr at position 297; Thr, His, or Gln at position 298; and/or Tyr, Asn, Asp, or Ser at position 299.

In some embodiments, the modified Fc polypeptide comprises Met at position 266; Leu or Glu at position 267; Trp at position 268; Pro at position 269; Val at position 270; Thr at position 271; Val or Thr at position 295; His at position 197; His, Arg, or Asn at position 198; and/or Pro at position 299.

In some embodiments, the modified Fc polypeptide comprises Asp at position 266; Asp at position 267; Leu at position 268; Thr at position 269; Phe at position 270; Gln at position 271; Val or Leu at position 295; Val at position 297; Thr at position 298; and/or Pro at position 299.

In some embodiments, the modified Fc polypeptide has at least 70% identity, at least 75% identity, at least 80% identity, at least 85% identity, at least 90% identity, or at least 95% identity to amino acids 1-110 of any one of SEQ ID NOS:134-138. In some embodiments, the modified Fc polypeptide has at least 70% identity, at least 75% identity, at least 80% identity, at least 85% identity, at least 90% identity, or at least 95% identity to SEQ ID NOS:134-138.

In some embodiments, the modified Fc polypeptide comprises the amino acid sequence of any one of SEQ ID NOS:134-138. In other embodiments, the modified Fc polypeptide comprises the amino acid sequence of any one of SEQ ID NOS:134-138, but in which one, two, or three amino acids are substituted.

In some embodiments, a modified Fc polypeptide that specifically binds to TfR comprises at least two, three, four, five, six, seven, eight, nine, or ten substitutions at positions 268, 269, 270, 271, 272, 292, 293, 294, and 300, according to the EU numbering scheme. Illustrative modified Fc polypeptides are provided in SEQ ID NOS:139-143. In some embodiments, the modified Fc polypeptide comprises at least one substitution at a position as follows: Val or Asp at position 268; Pro, Met, or Asp at position 269; Pro or Trp at position 270; Arg, Trp, Glu, or Thr at position 271; Met, Tyr, or Trp at position 272; Leu or Trp at position 292; Thr, Val, Ile, or Lys at position 293; Ser, Lys, Ala, or Leu at position 294; His, Leu, or Pro at position 296; or Val or Trp at position 300. In some embodiments, two, three, four, five, six, seven, eight, nine, or all ten of positions 268, 269, 270, 271, 272, 292, 293, 294, and 300 have a substitution as specified in this paragraph. In some embodiments, the modified Fc polypeptide may comprise a conservative substitution, e.g., an amino acid in the same charge grouping, hydrophobicity grouping, side chain ring structure grouping (e.g., aromatic amino acids), or size grouping, and/or polar or non-polar grouping, of a specified amino acid at one or more of the positions in the set.

In some embodiments, the modified Fc polypeptide comprises Val at position 268; Pro at position 269; Pro at position 270; Arg or Trp at position 271; Met at position 272; Leu at position 292; Thr at position 293; Ser at position 294; His at position 296; and/or Val at position 300.

In some embodiments, the modified Fc polypeptide comprises Asp at position 268; Met or Asp at position 269; Trp at position 270; Glu or Thr at position 271; Tyr or Trp at position 272; Trp at position 292; Val, Ile, or Lys at position 293; Lys, Ala, or Leu at position 294; Leu or Pro at position 296; and/or Trp at position 300.

In some embodiments, the modified Fc polypeptide has at least 70% identity, at least 75% identity, at least 80% identity, at least 85% identity, at least 90% identity, or at least 95% identity to amino acids 1-110 of any one of SEQ ID NOS:139-143. In some embodiments, the modified Fc polypeptide has at least 70% identity, at least 75% identity, at least 80% identity, at least 85% identity, at least 90% identity, or at least 95% identity to SEQ ID NOS:139-143. In some embodiments, the modified Fc polypeptide comprises the amino acid sequence of any one of SEQ ID NOS:139-143. In other embodiments, the modified Fc polypeptide comprises the amino acid sequence of any one of SEQ ID NOS:139-143, but in which one, two, or three amino acids are substituted.

In some embodiments, a modified Fc polypeptide that specifically binds to TfR has at least two, three, four, five, six, seven, eight, nine, or ten substitutions at positions 272, 274, 276, 322, 324, 326, 329, 330, and 331, according to the EU numbering scheme. Illustrative modified Fc polypeptides are provided in SEQ ID NOS:144-148. In some embodiments, the modified Fc polypeptide comprises Trp at position 330. In some embodiments, the modified Fc polypeptide comprises at least one substitution at a position as follows: Trp, Val, Ile, or Ala at position 272; Trp or Gly at position 274; Tyr, Arg, or Glu at position 276; Ser, Arg, or Gln at position 322; Val, Ser, or Phe at position 324; Ile, Ser, or Trp at position 326; Trp, Thr, Ser, Arg, or Asp at position 329; Trp at position 330; or Ser, Lys, Arg, or Val at position 331. In some embodiments, two, three, four, five, six, seven, eight, or all nine of positions 272, 274, 276, 322, 324, 326, 329, 330, and 331 have a substitution as specified in this paragraph. In some embodiments, the modified Fc polypeptide may comprise a conservative substitution, e.g., an amino acid in the same charge grouping, hydrophobicity grouping, side chain ring structure grouping (e.g., aromatic amino acids), or size grouping, and/or polar or non-polar grouping, of a specified amino acid at one or more of the positions in the set.

In some embodiments, the modified Fc polypeptide comprises two, three, four, five, six, seven, eight, or nine positions selected from the following: position 272 is Trp, Val, Ile, or Ala; position 274 is Trp or Gly; position 276 is Tyr, Arg, or Glu; position 322 is Ser, Arg, or Gln; position 324 is Val, Ser, or Phe; position 326 is Ile, Ser, or Trp; position 329 is Trp, Thr, Ser, Arg, or Asp; position 330 is Trp; and position 331 is Ser, Lys, Arg, or Val. In some embodiments, the modified Fc polypeptide comprises Val or Ile at position 272; Gly at position 274; Arg at position 276; Arg at position 322; Ser at position 324; Ser at position 326; Thr, Ser, or Arg at position 329; Trp at position 330; and/or Lys or Arg at position 331.

In some embodiments, the modified Fc polypeptide has at least 70% identity, at least 75% identity, at least 80% identity, at least 85% identity, at least 90% identity, or at least 95% identity to amino acids 1-110 of any one of SEQ ID NOS:144-148. In some embodiments, the modified Fc polypeptide has at least 70% identity, at least 75% identity, at least 80% identity, at least 85% identity, at least 90% identity, or at least 95% identity to SEQ ID NOS:144-148. In some embodiments, the modified Fc polypeptide comprises the amino acid sequence of any one of SEQ ID NOS:144-148. In other embodiments, the modified Fc polypeptide comprises the amino acid sequence of any one of SEQ ID NOS:144-148, but in which one, two, or three amino acids are substituted.

VI. Additional Fc Polypeptide Mutations

In some aspects, a fusion protein described herein comprises two Fc polypeptides that may each comprise independently selected modifications or may be a wild-type Fc polypeptide, e.g., a human IgG1 Fc polypeptide. In some embodiments, one or both Fc polypeptides contains one or more modifications that confer binding to a blood-brain barrier (BBB) receptor, e.g., transferrin receptor (TfR). Non-limiting examples of other mutations that can be introduced into one or both Fc polypeptides include, e.g., mutations to increase serum stability or serum half-life, to modulate effector function, to influence glycosylation, to reduce immunogenicity in humans, and/or to provide for knob and hole heterodimerization of the Fc polypeptides.

In some embodiments, the Fc polypeptides present in the fusion protein independently have an amino acid sequence identity of at least about 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% to a corresponding wild-type Fc polypeptide (e.g., a human IgG1, IgG2, IgG3, or IgG4 Fc polypeptide).

In some embodiments, the Fc polypeptides present in the fusion protein include knob and hole mutations to promote heterodimer formation and hinder homodimer formation. Generally, the modifications introduce a protuberance (“knob”) at the interface of a first polypeptide and a corresponding cavity (“hole”) in the interface of a second polypeptide, such that the protuberance can be positioned in the cavity so as to promote heterodimer formation and thus hinder homodimer formation. Protuberances are constructed by replacing small amino acid side chains from the interface of the first polypeptide with larger side chains (e.g., tyrosine or tryptophan). Compensatory cavities of identical or similar size to the protuberances are created in the interface of the second polypeptide by replacing large amino acid side chains with smaller ones (e.g., alanine or threonine). In some embodiments, such additional mutations are at a position in the Fc polypeptide that does not have a negative effect on binding of the polypeptide to a BBB receptor, e.g., TfR.

In one illustrative embodiment of a knob and hole approach for dimerization, position 366 (numbered according to the EU numbering scheme) of one of the Fc polypeptides present in the fusion protein comprises a tryptophan in place of a native threonine. The other Fc polypeptide in the dimer has a valine at position 407 (numbered according to the EU numbering scheme) in place of the native tyrosine. The other Fc polypeptide may further comprise a substitution in which the native threonine at position 366 (numbered according to the EU numbering scheme) is substituted with a serine and a native leucine at position 368 (numbered according to the EU numbering scheme) is substituted with an alanine. Thus, one of the Fc polypeptides of a fusion protein described herein has the T366W knob mutation and the other Fc polypeptide has the Y407V mutation, which is typically accompanied by the T366S and L368A hole mutations.

In some embodiments, modifications to enhance serum half-life may be introduced. For example, in some embodiments, one or both Fc polypeptides present in a fusion protein described herein may comprise a tyrosine at position 252, a threonine at position 254, and a glutamic acid at position 256, as numbered according to the EU numbering scheme. Thus, one or both Fc polypeptides may have M252Y, S254T, and T256E substitutions. Alternatively, one or both Fc polypeptides may have M428L and N434S substitutions, as numbered according to the EU numbering scheme. Alternatively, one or both Fc polypeptides may have an N434S or N434A substitution.

In some embodiments, one or both Fc polypeptides present in a fusion protein described herein may comprise modifications that reduce effector function, i.e., having a reduced ability to induce certain biological functions upon binding to an Fc receptor expressed on an effector cell that mediates the effector function. Examples of antibody effector functions include, but are not limited to, C1q binding and complement dependent cytotoxicity (CDC), Fc receptor binding, antibody-dependent cell-mediated cytotoxicity (ADCC), antibody-dependent cell-mediated phagocytosis (ADCP), down-regulation of cell surface receptors (e.g., B cell receptor), and B-cell activation. Effector functions may vary with the antibody class. For example, native human IgG1 and IgG3 antibodies can elicit ADCC and CDC activities upon binding to an appropriate Fc receptor present on an immune system cell; and native human IgG1, IgG2, IgG3, and IgG4 can elicit ADCP functions upon binding to the appropriate Fc receptor present on an immune cell.

In some embodiments, one or both Fc polypeptides present in a fusion protein described herein may also be engineered to contain other modifications for heterodimerization, e.g., electrostatic engineering of contact residues within a CH3-CH3 interface that are naturally charged or hydrophobic patch modifications.

In some embodiments, one or both Fc polypeptides present in a fusion protein described herein may include additional modifications that modulate effector function.

In some embodiments, one or both Fc polypeptides present in a fusion protein described herein may comprise modifications that reduce or eliminate effector function. Illustrative Fc polypeptide mutations that reduce effector function include, but are not limited to, substitutions in a CH2 domain, e.g., at positions 234 and 235, according to the EU numbering scheme. For example, in some embodiments, one or both Fc polypeptides can comprise alanine residues at positions 234 and 235. Thus, one or both Fc polypeptides may have L234A and L235A (LALA) substitutions.

Additional Fc polypeptide mutations that modulate an effector function include, but are not limited to, the following: position 329 may have a mutation in which proline is substituted with a glycine or arginine or an amino acid residue large enough to destroy the Fc/Fcγ receptor interface that is formed between proline 329 of the Fc and tryptophan residues Trp 87 and Trp 110 of FcγRIII Additional illustrative substitutions include S228P, E233P, L235E, N297A, N297D, and P331S, according to the EU numbering scheme. Multiple substitutions may also be present, e.g., L234A and L235A of a human IgG1 Fc region; L234A, L235A, and P329G of a human IgG1 Fc region; S228P and L235E of a human IgG4 Fc region; L234A and G237A of a human IgG1 Fc region; L234A, L235A, and G237A of a human IgG1 Fc region; V234A and G237A of a human IgG2 Fc region; L235A, G237A, and E318A of a human IgG4 Fc region; and S228P and L236E of a human IgG4 Fc region, according to the EU numbering scheme. In some embodiments, one or both Fc polypeptides may have one or more amino acid substitutions that modulate ADCC, e.g., substitutions at positions 298, 333, and/or 334, according to the EU numbering scheme.

Illustrative Fc Polypeptides Comprising Additional Mutations

By way of non-limiting example, one or both Fc polypeptides present in a fusion protein described herein may comprise additional mutations including a knob mutation (e.g., T366W as numbered according to the EU numbering scheme), hole mutations (e.g., T366S, L368A, and Y407V as numbered according to the EU numbering scheme), mutations that modulate effector function (e.g., L234A, L235A, and/or P329G (e.g., L234A and L235A) as numbered according to the EU numbering scheme), and/or mutations that increase serum stability or serum half-life (e.g., (i) M252Y, S254T, and T256E as numbered with reference to EU numbering, or (ii) N434S with or without M428L as numbered according to the EU numbering scheme).

In some embodiments, an Fc polypeptide may have a knob mutation (e.g., T366W as numbered according to the EU numbering scheme) and at least 85% identity, at least 90% identity, or at least 95% identity to the sequence of any one of SEQ ID NOS:1, 4-90, and 124-148. In some embodiments, an Fc polypeptide having the sequence of any one of SEQ ID NOS:1, 4-90, and 124-148 may be modified to have a knob mutation.

In some embodiments, an Fc polypeptide may have a knob mutation (e.g., T366W as numbered according to the EU numbering scheme), mutations that modulate effector function (e.g., L234A, L235A, and/or P329G (e.g., L234A and L235A) as numbered according to the EU numbering scheme), and at least 85% identity, at least 90% identity, or at least 95% identity to the sequence of any one of SEQ ID NOS:1, 4-90, and 124-148. In some embodiments, an Fc polypeptide having the sequence of any one of SEQ ID NOS:1, 4-90, and 124-148 may be modified to have a knob mutation and mutations that modulate effector function.

In some embodiments, an Fc polypeptide may have a knob mutation (e.g., T366W as numbered according to the EU numbering scheme), mutations that increase serum stability or serum half-life (e.g., (i) M252Y, S254T, and T256E as numbered with reference to EU numbering, or (ii) N434S with or without M428L as numbered according to the EU numbering scheme), and at least 85% identity, at least 90% identity, or at least 95% identity to the sequence of any one of SEQ ID NOS:1, 4-90, and 124-148. In some embodiments, an Fc polypeptide having the sequence of any one of SEQ ID NOS:1, 4-90, and 124-148 may be modified to have a knob mutation and mutations that increase serum stability or serum half-life.

In some embodiments, an Fc polypeptide may have a knob mutation (e.g., T366W as numbered according to the EU numbering scheme), mutations that modulate effector function (e.g., L234A, L235A, and/or P329G (e.g., L234A and L235A) as numbered according to the EU numbering scheme), mutations that increase serum stability or serum half-life (e.g., (i) M252Y, S254T, and T256E as numbered with reference to EU numbering, or (ii) N434S with or without M428L as numbered according to the EU numbering scheme), and at least 85% identity, at least 90% identity, or at least 95% identity to the sequence of any one of SEQ ID NOS:1, 4-90, and 124-148. In some embodiments, an Fc polypeptide having the sequence of any one of SEQ ID NOS:1, 4-90, and 124-148 may be modified to have a knob mutation, mutations that modulate effector function, and mutations that increase serum stability or serum half-life.

In some embodiments, an Fc polypeptide may have hole mutations (e.g., T366S, L368A, and Y407V as numbered according to the EU numbering scheme) and at least 85% identity, at least 90% identity, or at least 95% identity to the sequence of any one of SEQ ID NOS:1, 4-90, and 124-148. In some embodiments, an Fc polypeptide having the sequence of any one of SEQ ID NOS:1, 4-90, and 124-148 may be modified to have hole mutations.

In some embodiments, an Fc polypeptide may have hole mutations (e.g., T366S, L368A, and Y407V as numbered according to the EU numbering scheme), mutations that modulate effector function (e.g., L234A, L235A, and/or P329G (e.g., L234A and L235A) as numbered according to the EU numbering scheme), and at least 85% identity, at least 90% identity, or at least 95% identity to the sequence of any one of SEQ ID NOS:1, 4-90, and 124-148. In some embodiments, an Fc polypeptide having the sequence of any one of SEQ ID NOS:1, 4-90, and 124-148 may be modified to have hole mutations and mutations that modulate effector function.

In some embodiments, an Fc polypeptide may have hole mutations (e.g., T366S, L368A, and Y407V as numbered according to the EU numbering scheme), mutations that increase serum or serum half-life (e.g., (i) M252Y, S254T, and T256E as numbered with reference to EU numbering, or (ii) N434S with or without M428L as numbered according to the EU numbering scheme), and at least 85% identity, at least 90% identity, or at least 95% identity to the sequence of any one of SEQ ID NOS:1, 4-90, and 124-148. In some embodiments, an Fc polypeptide having sequence of any one of SEQ ID NOS:1, 4-90, and 124-148 may be modified to have hole mutations and mutations that increase serum stability or serum half-life.

In some embodiments, an Fc polypeptide may have hole mutations (e.g., T366S, L368A, and Y407V as numbered according to the EU numbering scheme), mutations that modulate effector function (e.g., L234A, L235A, and/or P329G (e.g., L234A and L235A) as numbered according to the EU numbering scheme), mutations that increase serum stability or serum half-life (e.g., (i) M252Y, S254T, and T256E as numbered with reference to EU numbering, or (ii) N434S with or without M428L as numbered according to the EU numbering scheme), and at least 85% identity, at least 90% identity, or at least 95% identity to the sequence of any one of SEQ ID NOS:1, 4-90, and 124-148. In some embodiments, an Fc polypeptide having the sequence of any one of SEQ ID NOS:1, 4-90, and 124-148 may be modified to have hole mutations, mutations that modulate effector function, and mutations that increase serum stability or serum half-life.

VII. Illustrative Fusion Proteins Comprising an ERT Enzyme

In some aspects, a fusion protein described herein comprises a first Fc polypeptide that is linked to an enzyme replacement therapy (ERT) enzyme, an ERT enzyme variant, or a catalytically active fragment thereof; and a second Fc polypeptide that forms an Fc dimer with the first Fc polypeptide. In some embodiments, the first Fc polypeptide and/or the second Fc polypeptide does not include an immunoglobulin heavy and/or light chain variable region sequence or an antigen-binding portion thereof. In some embodiments, the ERT enzyme is IDS, SGSH, ASM, or GBA. In some embodiments, the first Fc polypeptide is a modified Fc polypeptide and/or the second Fc polypeptide is a modified Fc polypeptide. In some embodiments, the second Fc polypeptide is a modified Fc polypeptide. In some embodiments, the modified Fc polypeptide contains one or more modifications that promote its heterodimerization to the other Fc polypeptide. In some embodiments, the modified Fc polypeptide contains one or more modifications that reduce effector function. In some embodiments, the modified Fc polypeptide contains one or more modifications that extend serum half-life. In some embodiments, the modified Fc polypeptide contains one or more modifications that confer binding to a blood-brain barrier (BBB) receptor, e.g., transferrin receptor (TfR).

In other aspects, a fusion protein described herein comprises a first polypeptide chain that comprises a modified Fc polypeptide that specifically binds to a BBB receptor, e.g., TfR, and a second polypeptide chain that comprises an Fc polypeptide which dimerizes with the modified Fc polypeptide to form an Fc dimer. An ERT enzyme may be linked to either the first or the second polypeptide chain. In some embodiments, the ERT enzyme is IDS, SGSH, ASM, or GBA. In some embodiments, the ERT enzyme is linked to the second polypeptide chain. In some embodiments, the protein comprises two ERT enzymes, each linked to one of the polypeptide chains. In some embodiments, the Fc polypeptide may be a BBB receptor-binding polypeptide that specifically binds to the same BBB receptor as the modified Fc polypeptide in the first polypeptide chain. In some embodiments, the Fc polypeptide does not specifically bind to a BBB receptor.

In some embodiments, a fusion protein described herein comprises a first polypeptide chain comprising a modified Fc polypeptide that specifically binds to TfR and a second polypeptide chain that comprises an Fc polypeptide, wherein the modified Fc polypeptide and the Fc polypeptide dimerize to from an Fc dimer. In some embodiments, the ERT enzyme is IDS, SGSH, ASM, or GBA. In some embodiments, the ERT enzyme is linked to the first polypeptide chain. In some embodiments, the ERT enzyme is linked to the second polypeptide chain. In some embodiments, the Fc polypeptide does not specifically bind to a BBB receptor, e.g., TfR.

In some embodiments, a fusion protein described herein comprises a first polypeptide chain that comprises a modified Fc polypeptide that binds to TfR and comprises a T366W (knob) substitution; and a second polypeptide chain that comprises an Fc polypeptide comprising T366S, L368A, and Y407V (hole) substitutions. In some embodiments, the modified Fc polypeptide and/or the Fc polypeptide further comprises L234A and L235A (LALA) substitutions. In some embodiments, the modified Fc polypeptide and/or the Fc polypeptide further comprises M252Y, S254T, and T256E (YTE) substitutions. In some embodiments, the modified Fc polypeptide and/or the Fc polypeptide further comprises L234A and L235A (LALA) substitutions and M252Y, S254T, and T256E (YTE) substitutions. In some embodiments, the modified Fc polypeptide and/or the Fc polypeptide comprises human IgG1 wild-type residues at positions 234, 235, 252, 254, 256, and 366.

In some embodiments, the modified Fc polypeptide comprises the knob, LALA, and YTE mutations as specified for any one of SEQ ID NOS:97-100, 151, 156-161, 168-173, 180-185, 192-197, 204-209, and 216-221, and has at least 85% identity, at least 90% identity, or at least 95% identity to the respective sequence; or comprises the sequence of any one of SEQ ID NOS:97-100, 151, 156-161, 168-173, 180-185, 192-197, 204-209, and 216-221. In some embodiments, the Fc polypeptide comprises the hole, LALA, and YTE mutations as specified for any one of SEQ ID NOS:101-104 and has at least 85% identity, at least 90% identity, or at least 95% identity to the respective sequence; or comprises the sequence of any one of SEQ ID NOS:101-104. In some embodiments, the modified Fc polypeptide comprises any one of SEQ ID NOS:97-100, 151, 156-161, 168-173, 180-185, 192-197, 204-209, and 216-221, and the Fc polypeptide comprises any one of SEQ ID NOS:101-104. In some embodiments, the N-terminus of the modified Fc polypeptide and/or the Fc polypeptide includes a portion of an IgG1 hinge region (e.g., DKTHTCPPCP; SEQ ID NO:113). In some embodiments, the modified Fc polypeptide has at least 85%, at least 90%, or at least 95% identity to any one of SEQ ID NOS:116, 228, and 229, or comprises the sequence of any one of SEQ ID NOS:116, 228, and 229.

In some embodiments, a fusion protein described herein comprises a first polypeptide chain that comprises a modified Fc polypeptide that binds to TfR and comprises T366S, L368A, and Y407V (hole) substitutions; and a second polypeptide chain that comprises an Fc polypeptide comprising a T366W (knob) substitution. In some embodiments, the modified Fc polypeptide and/or the Fc polypeptide further comprises L234A and L235A (LALA) substitutions. In some embodiments, the modified Fc polypeptide and/or the Fc polypeptide further comprises M252Y, S254T, and T256E (YTE) substitutions. In some embodiments, the modified Fc polypeptide and/or the Fc polypeptide further comprises L234A and L235A (LALA) substitutions and M252Y, S254T, and T256E (YTE) substitutions. In some embodiments, the modified Fc polypeptide and/or the Fc polypeptide comprises human IgG1 wild-type residues at positions 234, 235, 252, 254, 256, and 366.

In some embodiments, the modified Fc polypeptide comprises the hole, LALA, and YTE mutations as specified for any one of SEQ ID NOS:105-108, 162-167, 174-179, 186-191, 198-203, 210-215, and 222-227, and has at least 85% identity, at least 90% identity, or at least 95% identity to the respective sequence; or comprises the sequence of any one of SEQ ID NOS:105-108, 162-167, 174-179, 186-191, 198-203, 210-215, and 222-227. In some embodiments, the Fc polypeptide comprises the knob, LALA, and YTE mutations as specified for any one of SEQ ID NOS:109-112 and has at least 85% identity, at least 90% identity, or at least 95% identity to the respective sequence; or comprises the sequence of any one of SEQ ID NOS:109-112. In some embodiments, the modified Fc polypeptide comprises any one of SEQ ID NOS:105-108, 162-167, 174-179, 186-191, 198-203, 210-215, and 222-227, and the Fc polypeptide comprises any one of SEQ ID NOS:109-112. In some embodiments, the N-terminus of the modified Fc polypeptide and/or the Fc polypeptide includes a portion of an IgG1 hinge region (e.g., DKTHTCPPCP; SEQ ID NO:113).

In some embodiments, an ERT enzyme, e.g., IDS, SGSH, ASM, or GBA, present in a fusion protein described herein is linked to a polypeptide chain that comprises an Fc polypeptide having at least 85%, at least 90%, or at least 95% identity to any one of SEQ ID NOS:101-104, or comprises the sequence of any one of SEQ ID NOS:101-104 (e.g., as a fusion polypeptide). In some embodiments, the ERT enzyme, e.g., IDS, SGSH, ASM, or GBA, is linked to the Fc polypeptide by a linker, such as a flexible linker, and/or a hinge region or portion thereof (e.g., DKTHTCPPCP; SEQ ID NO:113). In some embodiments, the ERT enzyme comprises an IDS sequence having at least 85%, at least 90%, or at least 95% identity to any one of SEQ ID NOS:114, 230, and 234, or comprises the sequence of any one of SEQ ID NOS:114, 230, and 234. In some embodiments, the IDS sequence linked to the Fc polypeptide has at least 85%, at least 90%, or at least 95% identity to any one of SEQ ID NOS:115, 117, 231, 232, 235, and 236, or comprises the sequence of any one of SEQ ID NOS:115, 117, 231, 232, 235, and 236. In some embodiments, the ERT enzyme comprises an SGSH sequence having at least 85%, at least 90%, or at least 95% identity to SEQ ID NO:120, or comprises the sequence of SEQ ID NO:120. In some embodiments, the SGSH sequence linked to the Fc polypeptide has at least 85%, at least 90%, or at least 95% identity to any one of SEQ ID NOS:149 and 150, or comprises the sequence of any one of SEQ ID NOS:149 and 150. In some embodiments, the fusion protein comprises a modified Fc polypeptide having at least 85%, at least 90%, or at least 95% identity to any one of SEQ ID NOS:97-100, 151, 156-161, 168-173, 180-185, 192-197, 204-209, and 216-221, or comprises the sequence of any one of SEQ ID NOS:97-100, 151, 156-161, 168-173, 180-185, 192-197, 204-209, and 216-221. In some embodiments, the N-terminus of the Fc polypeptide and/or the modified Fc polypeptide includes a portion of an IgG1 hinge region (e.g., DKTHTCPPCP; SEQ ID NO:113). In some embodiments, the modified Fc polypeptide has at least 85%, at least 90%, or at least 95% identity to any one of SEQ ID NOS:116, 228, and 229, or comprises the sequence of any one of SEQ ID NOS:116, 228, and 229.

In some embodiments, the fusion protein comprises an IDS-Fc fusion polypeptide comprising the sequence of SEQ ID NO:115, and a modified Fc polypeptide comprising the sequence of any one of SEQ ID NOS:205 and 228. In other embodiments, the fusion protein comprises an IDS-Fc fusion polypeptide comprising the sequence of SEQ ID NO:115, and a modified Fc polypeptide comprising the sequence of any one of SEQ ID NOS:169 and 229.

In some embodiments, the fusion protein comprises an IDS-Fc fusion polypeptide comprising the sequence of SEQ ID NO:231, and a modified Fc polypeptide comprising the sequence of any one of SEQ ID NOS:205 and 228. In other embodiments, the fusion protein comprises an IDS-Fc fusion polypeptide comprising the sequence of SEQ ID NO:231, and a modified Fc polypeptide comprising the sequence of any one of SEQ ID NOS:169 and 229.

In some embodiments, the fusion protein comprises an IDS-Fc fusion polypeptide comprising the sequence of SEQ ID NO:235, and a modified Fc polypeptide comprising the sequence of any one of SEQ ID NOS:205 and 228. In other embodiments, the fusion protein comprises an IDS-Fc fusion polypeptide comprising the sequence of SEQ ID NO:235, and a modified Fc polypeptide comprising the sequence of any one of SEQ ID NOS:169 and 229.

In some embodiments, an ERT enzyme, e.g., IDS, SGSH, ASM, or GBA, present in a fusion protein described herein is linked to a polypeptide chain that comprises an Fc polypeptide having at least 85%, at least 90%, or at least 95% identity to any one of SEQ ID NOS:109-112, or comprises the sequence of any one of SEQ ID NOS:109-112 (e.g., as a fusion polypeptide). In some embodiments, the ERT enzyme, e.g., IDS SGSH, ASM, or GBA, is linked to the Fc polypeptide by a linker, such as a flexible linker, and/or a hinge region or portion thereof (e.g., DKTHTCPPCP; SEQ ID NO:113). In some embodiments, the ERT enzyme comprises an IDS sequence having at least 85%, at least 90%, or at least 95% identity to any one of SEQ ID NOS:114, 230, and 234, or comprises the sequence of any one of SEQ ID NOS:114, 230, and 234. In some embodiments, the IDS sequence linked to the Fc polypeptide has at least 85%, at least 90%, or at least 95% identity to any one of SEQ ID NOS:118, 233, and 237, or comprises the sequence of any one of SEQ ID NOS:118, 233, and 237. In some embodiments, the ERT enzyme comprises an SGSH sequence having at least 85%, at least 90%, or at least 95% identity to SEQ ID NO:120, or comprises the sequence of SEQ ID NO:120. In some embodiments, the SGSH sequence linked to the Fc polypeptide has at least 85%, at least 90%, or at least 95% identity to any one of SEQ ID NOS:152 and 153, or comprises the sequence of any one of SEQ ID NOS:152 and 153. In some embodiments, the fusion protein comprises a modified Fc polypeptide having at least 85%, at least 90%, or at least 95% identity to any one of SEQ ID NOS:105-108, 162-167, 174-179, 186-191, 198-203, 210-215, and 222-227, or comprises the sequence of any one of SEQ ID NOS:105-108, 162-167, 174-179, 186-191, 198-203, 210-215, and 222-227. In some embodiments, the N-terminus of the Fc polypeptide and/or the modified Fc polypeptide includes a portion of an IgG1 hinge region (e.g., DKTHTCPPCP; SEQ ID NO:113).

In some embodiments, an ERT enzyme, e.g., IDS, SGSH, ASM, or GBA, present in a fusion protein described herein is linked to a polypeptide chain that comprises a modified Fc polypeptide having at least 85%, at least 90%, or at least 95% identity to any one of SEQ ID NOS:97-100, 151, 156-161, 168-173, 180-185, 192-197, 204-209, and 216-221, or comprises the sequence of any one of SEQ ID NOS:97-100, 151, 156-161, 168-173, 180-185, 192-197, 204-209, and 216-221 (e.g., as a fusion polypeptide). In some embodiments, the ERT enzyme, e.g., IDS, SGSH, ASM, or GBA, is linked to the modified Fc polypeptide by a linker, such as a flexible linker, and/or a hinge region or portion thereof (e.g., DKTHTCPPCP; SEQ ID NO:113). In some embodiments, the ERT enzyme comprises an IDS sequence having at least 85%, at least 90%, or at least 95% identity to any one of SEQ ID NOS:114, 230, and 234, or comprises the sequence of any one of SEQ ID NOS:114, 230, and 234. In some embodiments, the ERT enzyme comprises an SGSH sequence having at least 85%, at least 90%, or at least 95% identity to SEQ ID NO:120, or comprises the sequence of SEQ ID NO:120. In some embodiments, the SGSH sequence linked to the modified Fc polypeptide has at least 85%, at least 90%, or at least 95% identity to any one of SEQ ID NOS:154 and 155, or comprises the sequence of any one of SEQ ID NOS:154 and 155. In some embodiments, the fusion protein comprises an Fc polypeptide having at least 85%, at least 90%, or at least 95% identity to any one of SEQ ID NOS:101-104, 149 and 150, or comprises the sequence of any one of SEQ ID NOS:101-104, 149 and 150. In some embodiments, the N-terminus of the modified Fc polypeptide and/or the Fc polypeptide includes a portion of an IgG1 hinge region (e.g., DKTHTCPPCP; SEQ ID NO:113).

In some embodiments, an ERT enzyme, e.g., IDS, SGSH, ASM, or GBA, present in a fusion protein described herein is linked to a polypeptide chain that comprises a modified Fc polypeptide having at least 85%, at least 90%, or at least 95% identity to any one of SEQ ID NOS:105-108, 162-167, 174-179, 186-191, 198-203, 210-215, and 222-227, or comprises the sequence of any one of SEQ ID NOS:105-108, 162-167, 174-179, 186-191, 198-203, 210-215, and 222-227 (e.g., as a fusion polypeptide). In some embodiments, the ERT enzyme, e.g., IDS, SGSH, ASM, or GBA, is linked to the modified Fc polypeptide by a linker, such as a flexible linker, and/or a hinge region or portion thereof (e.g., DKTHTCPPCP; SEQ ID NO:113). In some embodiments, the ERT enzyme comprises an IDS sequence having at least 85%, at least 90%, or at least 95% identity to any one of SEQ ID NOS:114, 230, and 234, or comprises the sequence of any one of SEQ ID NOS:114, 230, and 234. In some embodiments, the ERT enzyme comprises an SGSH sequence having at least 85%, at least 90%, or at least 95% identity to SEQ ID NO:120, or comprises the sequence of SEQ ID NO:120. In some embodiments, the fusion protein comprises an Fc polypeptide having at least 85%, at least 90%, or at least 95% identity to any one of SEQ ID NOS:109-112, or comprises the sequence of any one of SEQ ID NOS:109-112. In some embodiments, the fusion protein comprises an SGSH sequence linked to an Fc polypeptide having at least 85%, at least 90%, or at least 95% identity to any one of SEQ ID NOS:152 and 153, or comprising the sequence of any one of SEQ ID NOS:152 and 153. In some embodiments, the N-terminus of the modified Fc polypeptide and/or the Fc polypeptide includes a portion of an IgG1 hinge region (e.g., DKTHTCPPCP; SEQ ID NO:113).

VIII. Measuring Binding Kinetics, Affinity, Brain Concentration, and Brain Exposure

Fusion proteins and other compositions described herein may have a broad range of binding affinities. For example, in some embodiments, a protein has an affinity for a blood-brain barrier (BBB) receptor, e.g., transferrin receptor (TfR), ranging anywhere from 1 pM to 10 μM. In some embodiments, the affinity for TfR ranges from 1 nM to 5 μM, or from 10 nM to 1 μM. In some embodiments, the affinity for TfR ranges from about 50 nM to about 250 nM.

In some embodiments, the affinity of a TfR-binding polypeptide may be measured in a monovalent format. In other embodiments, affinity may be measured in a bivalent format, e.g., as a dimer comprising a polypeptide-Fab fusion protein.

Methods for analyzing binding affinity, binding kinetics, and cross-reactivity to analyze binding to a BBB receptor, e.g., TfR, are known in the art. These methods include, but are not limited to, solid-phase binding assays (e.g., ELISA assay), immunoprecipitation, surface plasmon resonance (e.g., Biacore™ (GE Healthcare, Piscataway, N.J.)), kinetic exclusion assays (e.g., KinExA®), flow cytometry, fluorescence-activated cell sorting (FACS), BioLayer interferometry (e.g., Octet® (FortéBio, Inc., Menlo Park, Calif.)), and Western blot analysis. In some embodiments, ELISA is used to determine binding affinity and/or cross-reactivity. Methods for performing ELISA assays are known in the art and are also described in the Examples section below. In some embodiments, surface plasmon resonance (SPR) is used to determine binding affinity, binding kinetics, and/or cross-reactivity. In some embodiments, kinetic exclusion assays are used to determine binding affinity, binding kinetics, and/or cross-reactivity. In some embodiments, BioLayer interferometry assays are used to determine binding affinity, binding kinetics, and/or cross-reactivity.

A non-limiting example of a method for determining binding affinity (e.g., for TfR) is described in Example 13 below, in which a Biacore™ instrument was used to determine affinity by surface plasmon resonance (SPR). In this method, an engineered TfR-binding polypeptide, a TfR-binding peptide, or a TfR-binding antibody of interest is captured on a sensor chip and serial dilutions of TfR are injected onto the sensor chip at a specified flow rate (e.g., 30 μL/min) and temperature (e.g., room temperature). Samples are analyzed using specified association and dissociation times (e.g., 45 and 180 seconds, respectively), followed by sensor chip regeneration. Binding responses are corrected by subtracting the measured response from a control (e.g., using an irrelevant IgG at similar density) and then steady-state affinities can be determined by using software to fit the equilibrium response against concentration.

The concentration of an engineered TfR-binding polypeptide, a TfR-binding peptide, a TfR-binding antibody, or an agent (e.g., linked to the engineered TfR-binding polypeptide, TfR-binding peptide, or TfR-binding antibody) in the brain and/or plasma can be measured, for example, using a human transferrin receptor (hTfR) knock-in mouse model. Such a model can be used, for example, to measure and/or compare maximum brain concentration (C_(max)) and/or brain exposure, e.g., to determine whether C_(max) is increased and/or brain exposure is prolonged. The creation of a human apical TfR (TfR^(ms/hu)) mouse knock-in model is described below in Example 12. To create a suitable model, a CRISPR/Cas9 system can be used to generate a mouse that expresses a human Tfrc apical domain within a murine Tfrc gene (e.g., in which in vivo expression is under the control of an endogenous promoter). In particular, Cas9, single guide RNAs and donor DNA (e.g., a human apical domain coding sequence that has been codon optimized for expression in mouse) can be introduced into mouse embryos (e.g., by pronuclear injection). The embryos can then be transferred to pseudo pregnant females. A founder male from the progeny of the female that received the embryos can be bred to wild-type females to generate F1 heterozygous mice. Homozygous mice can then be subsequently generated from breeding of F1 generation heterozygous mice.

For evaluation of brain and/or plasma concentration or exposure of the engineered TfR-binding polypeptide, TfR-binding peptide, TfR-binding antibody, or agent (e.g., linked to the engineered TfR-binding polypeptide, TfR-binding peptide, or TfR-binding antibody), the engineered TfR-binding polypeptide, TfR-binding peptide, TfR-binding antibody (e.g., linked to the agent) can be administered to the mouse model (e.g., TfR^(ms/hu)). Plasma samples can be obtained from the mouse after a suitable period of time, followed by perfusion of the vascular system with a suitable solution. Following perfusion, brains (or portions thereof) can be extracted and homogenized and lysed. Concentrations of the agent in the plasma and/or brain lysate can then be determined using standard methods that will be known to one of ordinary skill in the art. As a non-limiting example, an ELISA-based assay such as one described in Example 3 below can be used to measure concentrations. Briefly, the concentration of an agent, engineered TfR-binding polypeptide, TfR-binding peptide, or TfR-binding antibody (e.g., in plasma or lysate) can be quantified using a sandwich ELISA. A capture antibody (e.g., an anti-Fc capture antibody) can be coated onto a plate (e.g., a 384-well MaxiSorp™ plate) at a desired concentration (e.g., about 3 μg/mL). The plate is blocked (e.g., with 5% BSA) and then incubated with plasma that has been diluted (e.g., 1:1,000 or 1:10,000). Next, a detection antibody is added at a desired concentration (e.g., about 0.5 μg/mL) followed by a secondary antibody such as an anti-goat-HRP antibody. The plates are then developed (e.g., using TMB substrate), stopped (e.g., with sulfuric acid), and the absorbance at an appropriate wavelength (e.g., 450 nm) measured on a plate reader (e.g., a BioTek plate reader). Standard curves can be generated using an appropriate (e.g., 4-fold) dilution series and fit using an algorithm such as a four-parameter logistic regression.

By administering a range of doses to the knock-in mouse model, a standard curve can be generated. By administering to the knock-in mouse model an agent linked to different engineered TfR-binding polypeptides, TfR-binding peptides, or TfR-binding antibodies (e.g., having different TfR affinities), or an agent linked to a reference polypeptide or protein (e.g., that has a weaker affinity for TfR than the polypeptide or protein of interest), comparisons can be made regarding the effects of the engineered TfR-binding polypeptides, TfR-binding peptides, or TfR-binding antibodies on brain exposure to the agent and/or C_(max) values of the agent in the brain.

IX. ERT Enzymes Linked to Fc Polypeptides

In some embodiments, a fusion protein described herein comprises two Fc polypeptides as described herein and one or both of the Fc polypeptides may further comprise a partial or full hinge region. The hinge region can be from any immunoglobulin subclass or isotype. An illustrative immunoglobulin hinge is an IgG hinge region, such as an IgG1 hinge region, e.g., human IgG1 hinge amino acid sequence EPKSCDKTHTCPPCP (SEQ ID NO:95) or a portion thereof (e.g., DKTHTCPPCP; SEQ ID NO:113). In some embodiments, the hinge region is at the N-terminal region of the Fc polypeptide.

In some embodiments, an Fc polypeptide is joined to the ERT enzyme by a linker, e.g., a peptide linker. In some embodiments, the Fc polypeptide is joined to the ERT enzyme by a peptide bond or by a peptide linker, e.g., is a fusion polypeptide. The peptide linker may be configured such that it allows for the rotation of the ERT enzyme relative to the Fc polypeptide to which it is joined; and/or is resistant to digestion by proteases. Peptide linkers may contain natural amino acids, unnatural amino acids, or a combination thereof. In some embodiments, the peptide linker may be a flexible linker, e.g., containing amino acids such as Gly, Asn, Ser, Thr, Ala, and the like. Such linkers are designed using known parameters and may be of any length and contain any number of repeat units of any length (e.g., repeat units of Gly and Ser residues). For example, the linker may have repeats, such as two, three, four, five, or more Gly4-Ser (SEQ ID NO:239) repeats or a single Gly4-Ser (SEQ ID NO:239). In some embodiments, the peptide linker may include a protease cleavage site, e.g., that is cleavable by an enzyme present in the central nervous system.

In some embodiments, the ERT enzyme is joined to the N-terminus of the Fc polypeptide, e.g., by a Gly4-Ser linker (SEQ ID NO:239) or a (Gly4-Ser)₂ linker (SEQ ID NO:240). In some embodiments, the Fc polypeptide may comprise a hinge sequence or partial hinge sequence at the N-terminus that is joined to the linker or directly joined to the ERT enzyme.

In some embodiments, the ERT enzyme is joined to the C-terminus of the Fc polypeptide, e.g., by a Gly4-Ser linker (SEQ ID NO:239) or a (Gly4-Ser)₂ linker (SEQ ID NO:240). In some embodiments, the C-terminus of the Fc polypeptide is directly joined to the ERT enzyme.

In some embodiments, the ERT enzyme is joined to the Fc polypeptide by a chemical cross-linking agent. Such conjugates can be generated using well-known chemical cross-linking reagents and protocols. For example, there are a large number of chemical cross-linking agents that are known to those skilled in the art and useful for cross-linking the polypeptide with an agent of interest. For example, the cross-linking agents are heterobifunctional cross-linkers, which can be used to link molecules in a stepwise manner. Heterobifunctional cross-linkers provide the ability to design more specific coupling methods for conjugating proteins, thereby reducing the occurrences of unwanted side reactions such as homo-protein polymers. A wide variety of heterobifunctional cross-linkers are known in the art, including N-hydroxysuccinimide (NHS) or its water soluble analog N-hydroxysulfosuccinimide (sulfo-NHS), succinimidyl 4-(N-maleimidomethyl)cyclohexane-1-carboxylate (SMCC), m-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS); N-succinimidyl (4-iodoacetyl) aminobenzoate (SIAB), succinimidyl 4-(p-maleimidophenyl)butyrate (SMPB), 1-ethyl-3-(3-dimethylaminopropyl)carbodiimide hydrochloride (EDC); 4-succinimidyloxycarbonyl-a-methyl-a-(2-pyridyldithio)-toluene (SMPT), N-succinimidyl 3-(2-pyridyldithio)propionate (SPDP), and succinimidyl 6-[3-(2-pyridyldithio)propionate]hexanoate (LC-SPDP). Those cross-linking agents having N-hydroxysuccinimide moieties can be obtained as the N-hydroxysulfosuccinimide analogs, which generally have greater water solubility. In addition, those cross-linking agents having disulfide bridges within the linking chain can be synthesized instead as the alkyl derivatives so as to reduce the amount of linker cleavage in vivo. In addition to the heterobifunctional cross-linkers, there exist a number of other cross-linking agents including homobifunctional and photoreactive cross-linkers. Disuccinimidyl subcrate (DSS), bismaleimidohexane (BMH) and dimethylpimelimidate. 2HCl (DMP) are examples of useful homobifunctional cross-linking agents, and bis[B-(4-azidosalicylamido)ethyl]disulfide (BASED) and N-succinimidyl-6(4′-azido-2′-nitrophenylamino)hexanoate (SANPAH) are examples of useful photoreactive cross-linkers.

X. Evaluation of Protein Activity

Activity of fusion proteins described herein that comprise an ERT enzyme such as, e.g., IDS, SGSH, ASM, or GBA, can be assessed using various assays, including assays that measure activity in vitro using an artificial substrate, such as those described in the Examples section. An illustrative protocol for measuring IDS activity in vitro is provided in Example 2. An illustrative protocol for measuring ASM activity in vitro is provided in Example 5. Illustrative protocols for measuring SGSH activity in vitro are provided in Examples 7 and 8.

In some aspects, IDS activity is assessed by assaying a sample, such as a cell sample, tissue sample, or fluid sample (e.g., CSF or urine), for the amount of the glycosaminoglycans (GAGs) heparan and dermatan sulfate, which accumulate as a result of IDS deficiency. The amount of heparan and dermatan sulfate is determined by digesting GAGs present in a sample with heparinase and chondroitinase. The resulting disaccharides can then be assayed by mass spectrometry (e.g., LC-MS/MS). Samples with high levels of heparan and dermatan sulfate accumulation will have increased amounts of heparan and dermatan sulfate-derived disaccharides. Thus, the level of disaccharides is inversely proportional to IDS enzymatic activity.

The mass spectrometry (e.g., LC-MS/MS) assay can be performed on any sample in which GAGs accumulate, including cell samples, tissue samples, and fluid samples. Such samples can be evaluated to monitor the activity of an IDS-containing protein described herein, e.g., that is administered to cells in vitro, or in some embodiments, administered to a subject in vivo. The subject may be an animal, such as a rodent, e.g., a mouse, or a non-human primate. In some embodiments, the subject is a human patient, such as a patient having Hunter syndrome that is undergoing treatment with an IDS therapy, wherein the assay is used to monitor IDS activity in the patient. In some embodiments, the human patient is undergoing treatment with a fusion protein described herein.

For cellular samples, such as cells or tissue samples, the assay comprises disrupting the cells and breaking open microvesicles. Disruption of cells or breaking open microvesicles may be achieved by using freeze-thawing and/or sonication to obtain an extract (e.g., a cellular extract) comprising GAGs. The GAGs are then subject to treatment with heparinases (e.g., any described herein) and chondroitinase, which break down heparan sulfate and dermatan sulfate GAGs. Following digestion, a supernatant containing the GAG disaccharides is obtained and the disaccharide products are analyzed by mass spectrometry (e.g., LC-MS/MS). An illustrative protocol is provided in Example 2.

In some embodiments, a cell sample to be assayed for IDS activity is washed and frozen. Cell pellets are sonicated in a disaccharide digestion buffer. A desired amount of total protein from the sonicated sample is then added to digestion buffer comprising heparinase I, heparinase II, heparinase III, and chondroitinase B, the latter of which is specific for dermatan sulfate. Following digestion, e.g., about 3 hours at 30° C., the enzymes are deactivated with EDTA and boiling. Sample are then centrifuged, e.g., at 16,000×G, and the supernatant transferred to a centrifugal filter and centrifuged at about 14,000×G. Disaccharides are then resuspended in a mixture of assay buffer: acetonitrile at a 1:1 v/v ratio and further analyzed by liquid chromatography coupled to electrospray mass spectrometry, e.g., as described in Example 2. GAG-derived disaccharide products can be identified based on a retention time compared to those of commercially available reference standards. Illustrative heparan sulfate-derived disaccharides include DOSO and D250 (nomenclature according to Lawrence et al., Nat. Methods, 5:291-292 (2008)).

In other aspects, SGSH activity is assessed by assaying a sample, such as a cell sample or tissue sample, for the amount of heparan sulfate glycosaminoglycans (GAGs), which accumulate as a result of SGSH deficiency. The amount of heparan sulfate is determined by digesting GAGs present in a sample with heparinase (e.g., any described herein). The resulting disaccharides can then be assayed by mass spectrometry (e.g., LC-MS/MS). Samples with high levels of heparan sulfate accumulation will have increased amounts of heparan sulfate-derived disaccharides. Thus, the level of disaccharides is inversely proportional to SGSH enzymatic activity.

The mass spectrometry (e.g., LC-MS/MS) assay can be performed on any sample in which GAGs accumulate, including cell samples, tissue samples, and fluid samples. Such samples can be evaluated to monitor the activity of an SGSH-containing protein described herein, e.g., that is administered to cells in vitro, or in some embodiments, administered to a subject in vivo. The subject may be an animal, such as a rodent, e.g., a mouse, or a non-human primate. In some embodiments, the subject is a human patient, such as a patient having Sanfilippo syndrome A that is undergoing treatment with an SGSH therapy, wherein the assay is used to monitor SGSH activity in the patient. In some embodiments, the human patient is undergoing treatment with a fusion protein described herein.

For cellular samples, such as cells or tissue samples, the assay comprises disrupting the cells and/or breaking open microvesicles. Disruption of cells or breaking open microvesicles may be achieved by using freeze-thawing and/or sonication to obtain an extract (e.g., a cellular extract) comprising GAGs. The GAGs are then subject to treatment with heparinases (e.g., any described herein), which break down heparan sulfate GAGs. Following digestion, a supernatant containing the GAG disaccharides is obtained and the disaccharide products are analyzed by mass spectrometry (e.g., LC-MS/MS). An illustrative protocol is provided in Example 7.

In some embodiments, a cell sample to be assayed for SGSH activity is washed and frozen. Cell pellets are sonicated in a disaccharide digestion buffer. A desired amount of total protein from the sonicated sample is then added to digestion buffer comprising heparinase I, heparinase II, and/or heparinase III. Following digestion, e.g., about 3 hours at 30° C., the enzymes are deactivated with EDTA and boiling. Sample are then centrifuged, e.g., at 16,000×G, and the supernatant transferred to a centrifugal filter and centrifuged at about 14,000×G. Disaccharides are then resuspended in a mixture of assay buffer:acetonitrile at a 1:1 v/v ratio and further analyzed by liquid chromatography coupled to electrospray mass spectrometry, e.g., as described in Example 7. GAG-derived disaccharide products can be identified based on a retention time compared to those of commercially available reference standards. Illustrative heparan sulfate-derived disaccharides include D0S0 and D2S0 (nomenclature according to Lawrence et al., Nat. Methods, 5:291-292 (2008)).

In some embodiments, a tissue sample is evaluated. A tissue sample can be evaluated using an assay as described above, except multiple free-thaw cycles, e.g., 2, 3, 4, 5, or more, are typically included before the sonication step to ensure that microvesicles are broken open.

Samples that can be evaluated by the assays described herein include brain, liver, kidney, lung, spleen, plasma, serum, cerebrospinal fluid (CSF), and urine. In some embodiments, CSF samples from a patient receiving an enzyme-Fc fusion protein (e.g., an IDS-Fc or SGSH-Fc fusion protein) described herein may be evaluated.

XI. Nucleic Acids, Vectors, and Host Cells

Polypeptide chains contained in the fusion proteins as described herein are typically prepared using recombinant methods. Accordingly, in some aspects, the present disclosure provides isolated nucleic acids comprising a nucleic acid sequence encoding any of the polypeptide chains comprising Fc polypeptides as described herein, and host cells into which the nucleic acids are introduced that are used to replicate the polypeptide-encoding nucleic acids and/or to express the polypeptides. In some embodiments, the host cell is eukaryotic, e.g., a human cell.

In another aspect, polynucleotides are provided that comprise a nucleotide sequence that encodes the polypeptide chains described herein. The polynucleotides may be single-stranded or double-stranded. In some embodiments, the polynucleotide is DNA. In particular embodiments, the polynucleotide is cDNA. In some embodiments, the polynucleotide is RNA.

In some embodiments, the polynucleotide is included within a nucleic acid construct. In some embodiments, the construct is a replicable vector. In some embodiments, the vector is selected from a plasmid, a viral vector, a phagemid, a yeast chromosomal vector, and a non-episomal mammalian vector.

In some embodiments, the polynucleotide is operably linked to one or more regulatory nucleotide sequences in an expression construct. In one series of embodiments, the nucleic acid expression constructs are adapted for use as a surface expression library. In some embodiments, the library is adapted for surface expression in yeast. In some embodiments, the library is adapted for surface expression in phage. In another series of embodiments, the nucleic acid expression constructs are adapted for expression of the polypeptide in a system that permits isolation of the polypeptide in milligram or gram quantities. In some embodiments, the system is a mammalian cell expression system. In some embodiments, the system is a yeast cell expression system.

Expression vehicles for production of a recombinant polypeptide include plasmids and other vectors. For instance, suitable vectors include plasmids of the following types: pBR322-derived plasmids, pEMBL-derived plasmids, pEX-derived plasmids, pBTac-derived plasmids, and pUC-derived plasmids for expression in prokaryotic cells, such as E. coli. The pcDNAI/amp, pcDNAI/neo, pRc/CMV, pSV2gpt, pSV2neo, pSV2-dhfr, pTk2, pRSVneo, pMSG, pSVT7, pko-neo, and pHyg-derived vectors are examples of mammalian expression vectors suitable for transfection of eukaryotic cells. Alternatively, derivatives of viruses such as the bovine papilloma virus (BPV-1), or Epstein-Barr virus (pHEBo, pREP-derived, and p205) can be used for transient expression of polypeptides in eukaryotic cells. In some embodiments, it may be desirable to express the recombinant polypeptide by the use of a baculovirus expression system. Examples of such baculovirus expression systems include pVL-derived vectors (such as pVL1392, pVL1393, and pVL941), pAcUW-derived vectors (such as pAcUW1), and pBlueBac-derived vectors. Additional expression systems include adenoviral, adeno-associated virus, and other viral expression systems.

Vectors may be transformed into any suitable host cell. In some embodiments, the host cells, e.g., bacteria or yeast cells, may be adapted for use as a surface expression library. In some cells, the vectors are expressed in host cells to express relatively large quantities of the polypeptide. Such host cells include mammalian cells, yeast cells, insect cells, and prokaryotic cells. In some embodiments, the cells are mammalian cells, such as Chinese Hamster Ovary (CHO) cell, baby hamster kidney (BHK) cell, NS0 cell, Y0 cell, HEK293 cell, COS cell, Vero cell, or HeLa cell.

A host cell transfected with an expression vector encoding one or more Fc polypeptide chains as described herein can be cultured under appropriate conditions to allow expression of the one or more polypeptides to occur. The polypeptides may be secreted and isolated from a mixture of cells and medium containing the polypeptides. Alternatively, the polypeptides may be retained in the cytoplasm or in a membrane fraction and the cells harvested, lysed, and the polypeptide isolated using a desired method.

XII. Therapeutic Methods

A fusion protein or an agent (e.g., a therapeutic agent) linked to an engineered TfR-binding polypeptide, TfR-binding peptide, or TfR-binding antibody described herein may be used therapeutically to treat an LSD. In some embodiments, a patient having Hunter syndrome is treated with a fusion protein or an agent linked to an engineered TfR-binding polypeptide, TfR-binding peptide, or TfR-binding antibody that comprises IDS. In some embodiments, a patient having Sanfilippo syndrome A is treated with a fusion protein or an agent linked to an engineered TfR-binding polypeptide, TfR-binding peptide, or TfR-binding antibody that comprises SGSH. In some embodiments, a patient having Niemann-Pick disease is treated with a fusion protein or an agent linked to an engineered TfR-binding polypeptide, TfR-binding peptide, or TfR-binding antibody that comprises ASM. In some embodiments, a patient having Gaucher disease or Parkinson's disease is treated with a fusion protein or an agent linked to an engineered TfR-binding polypeptide, TfR-binding peptide, or TfR-binding antibody that comprises GBA.

A fusion protein described herein that comprises an ERT enzyme, e.g., IDS, SGSH, ASM, or GBA, is administered to a subject at a therapeutically effective amount or dose. Illustrative dosages include a daily dose range of about 0.01 mg/kg to about 500 mg/kg, or about 0.1 mg/kg to about 200 mg/kg, or about 1 mg/kg to about 100 mg/kg, or about 10 mg/kg to about 50 mg/kg, can be used. In some embodiments, the protein has an enzymatic activity of at least about 500 units (U)/mg, about 1,000 U/mg, or at least about 1,500, 2,000, 2,500, 3,000, 3,500, 4,000, 4,500, 5,000, 6,000, 7,000, 8,000, 9,000, or 10,000 U/mg. In some embodiments, the enzymatic activity is at least about 11,000 U/mg, or at least about 12,000, 13,000, 14,000, 15,000, 16,000, 17,000, 18,000, 19,000, 20,000, 25,000, 30,000, 35,000, 40,000, 45000, or 50,000 U/mg; or anywhere in a range of about 500 U/mg to about 50,000 U/mg. The dosages, however, may be varied according to several factors, including the chosen route of administration, the formulation of the composition, patient response, the severity of the condition, the subject's weight, and the judgment of the prescribing physician. The dosage can be increased or decreased over time, as required by an individual patient. In some embodiments, a patient initially is given a low dose, which is then increased to an efficacious dosage tolerable to the patient. Determination of an effective amount is well within the capability of those skilled in the art.

In various embodiments, a fusion protein described herein is administered parenterally. In some embodiments, the protein is administered intravenously. Intravenous administration can be by infusion, e.g., over a period of from about 10 to about 30 minutes, or over a period of at least 1 hour, 2 hours, or 3 hours. In some embodiments, the protein is administered as an intravenous bolus. Combinations of infusion and bolus administration may also be used.

In some parenteral embodiments, a fusion protein or agent (e.g., therapeutic agent) linked to an engineered TfR-binding polypeptide, TfR-binding peptide, or TfR-binding antibody is administered intraperitoneally, subcutaneously, intradermally, or intramuscularly. In some embodiments, the protein or agent linked to an engineered TfR-binding polypeptide, TfR-binding peptide, or TfR-binding antibody is administered intradermally or intramuscularly. In some embodiments, the protein or agent linked to an engineered TfR-binding polypeptide, TfR-binding peptide, or TfR-binding antibody is administered intrathecally, such as by epidural administration, or intracerebroventricularly.

In other embodiments, a fusion protein or an agent (e.g., therapeutic agent) linked to an engineered TfR-binding polypeptide, TfR-binding peptide, or TfR-binding antibody may be administered orally, by pulmonary administration, intranasal administration, intraocular administration, or by topical administration. Pulmonary administration can also be employed, e.g., by use of an inhaler or nebulizer, and formulation with an aerosolizing agent.

XIII. Methods for Protein Replacement

In other aspects, provided herein is a method for transporting an agent (e.g., an agent that is useful for treating a lysosomal storage disorder (LSD)) across the blood-brain barrier (BBB) of a mammal. In some embodiments, the method comprises exposing the BBB to a polypeptide or protein that binds (e.g., specifically binds) to a transferrin receptor (TfR) with an affinity of from about 50 nM to about 250 nM. In some embodiments, the polypeptide or protein is linked to the agent and transports the linked agent across the BBB. In some embodiments, the maximum concentration (C_(max)) of the agent in the brain of the mammal is improved (e.g., increased).

In other aspects, provided herein is a method for treating an LSD. In some embodiments, the method comprises administering to a mammal a polypeptide or protein that binds (e.g., specifically binds) to a TfR with an affinity of from about 50 nM to about 250 nM. In some embodiments, the polypeptide or protein is linked to an agent for treating the LSD, thereby exposing the brain of the mammal to the agent.

In some embodiments, the polypeptide or protein binds (e.g., specifically binds) to a TfR with an affinity of about 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 210, 220, 230, 240, or 250 nM. In some embodiments, the polypeptide or protein binds to a TfR with an affinity of from about 100 nM to about 200 nM or from about 110 nM to about 150 nM.

In some embodiments, the polypeptide or protein (e.g., linked to the agent) improves (e.g., increases) C_(max) of the agent in the brain as compared to the agent linked to a reference polypeptide or protein that binds (e.g., specifically binds) to a TfR with a weaker affinity.

In some embodiments, C_(max) of the agent in the brain is improved (e.g., increased) by at least about 1.1-fold, 1.2-fold, 1.3-fold, 1.4-fold, 1.5-fold, 1.6-fold, 1.7-fold, 1.8-fold, 1.9-fold, 2-fold, 2.2-fold, 2.4-fold, 2.6-fold, 2.8-fold, 3-fold, 4-fold, 5-fold, or more, as compared to the agent that is linked to a reference polypeptide or protein (e.g., that binds to a TfR with a weaker affinity).

In some embodiments, the brain of the mammal is exposed to the agent at a therapeutically effective concentration (e.g., a concentration that is sufficient to treat one or more signs or symptoms of an LSD) for a shorter duration as compared to the agent that is linked to the reference polypeptide or protein. In some embodiments, brain exposure duration is shortened by at least about 5%, 10%, 25%, 40%, 50%, 60%, 75%, 85%, 90%, 95%, or 98%.

In some embodiments, brain exposure is quantified by plotting brain exposure (e.g., concentration of the agent in the brain) as a function of time and calculating the area under the curve (AUC). Decreased AUC can represent decreased or shortened brain exposure. In some embodiments, duration of brain exposure to the agent (e.g., at a therapeutically effective concentration) is shortened.

In some embodiments, the reference polypeptide or protein binds (e.g., specifically binds) to the TfR with an affinity that is, or is weaker than, about 250 nM, 300 nM, 350 nM, 400 nM, 450 nM, 500 nM, 550 nM, or 600 nM. In some embodiments, the reference polypeptide or protein binds to the TfR with an affinity that is, or is weaker than, about 600 nM.

In some embodiments, the mammal is a primate (e.g., a human). In some embodiments, the human is a patient in need of treatment for an LSD. In some embodiments, the patient has one or more signs or symptoms of an LSD.

In some embodiments, the polypeptide or protein binds (e.g., specifically binds) to a primate TfR. In some embodiments, the primate TfR is a human TfR. In some embodiments, the polypeptide or protein binds to a TfR apical domain.

In some embodiments, the agent (e.g., therapeutic agent) is linked to an engineered TfR-binding polypeptide. In some embodiments, the engineered TfR-binding polypeptide comprises CH3 or CH2 domains that have modifications that allow the polypeptide to specifically bind to TfR. Non-limiting examples of suitable engineered TfR-binding polypeptides are described herein. In some embodiments, the agent is linked to an engineered TfR-binding polypeptide that is described in Table 4 or Table 5. In some embodiments, the agent is linked to an engineered TfR-binding polypeptide selected from the group consisting of CH3C.35.20.2, CH3C.35.23.2, CH3C.35.23.5, CH3C.35.21.17, and CH3C.35.21.17.2.

In some embodiments, the agent (e.g., therapeutic agent) is linked to a TfR-binding peptide. In some embodiments, the TfR-binding peptide is a short peptide, being about 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 amino acids in length. Methods for generating, screening, and identifying suitable peptides (i.e., that bind to a TfR with an affinity within the desired range) are known in the art. For example, a phage display strategy in which alternating rounds of negative and positive selection are employed can be used to identify suitable peptides. This strategy is described, e.g., in Lee et al., Eur. J. Biochem., 268:2004-2012 (2001), which is hereby incorporated in its entirety for all purposes.

In some embodiments, the agent (e.g., therapeutic agent) is linked to a TfR-binding antibody. Non-limiting examples of suitable TfR-binding antibodies include OX26 anti-TfR antibodies (i.e., having affinities of about 76 nM, 108 nM, and 174 nM) that are disclosed in Thom et al., Mol. Pharm., 15(4):1420-1431 (2018). In some embodiments, the agent is linked to a protein that comprises an antibody variable region that specifically binds to TfR. In some instances, the protein comprises a Fab or an scFv.

In some embodiments, the agent (e.g., therapeutic agent) is a protein (e.g., an enzyme). In some embodiments, the agent is a protein replacement therapeutic. In some embodiments, the agent is a protein or enzyme that is deficient (e.g., underexpressed or absent) in a cell or tissue (e.g., a neural cell or tissue) in the mammal. In some embodiments, the agent is a protein or enzyme that is endogenous to, or expressed in, a normal healthy cell or tissue (e.g., a neural cell or tissue) in the mammal, but is deficient (e.g., in the corresponding cell or tissue) in the mammal that is being treated for the LSD.

In some embodiments, the protein replacement therapeutic is an enzyme. Any number of agents (e.g., protein replacement therapeutics such as enzymes) can be linked to polypeptides or proteins (e.g., that bind to TfR) for the treatment of various LSDs. In some embodiments, the agent is an enzyme that decreases the accumulation of a toxic metabolic product in the brain of the mammal having the LSD to a greater extent when linked to the polypeptide or protein, as compared to when the enzyme is linked to the reference polypeptide or protein. In some embodiments, the enzyme is iduronate 2-sulfatase (IDS) and the LSD is Hunter syndrome. In some instances, the toxic metabolic product comprises heparin sulfate-derived disaccharides and/or dermatan sulfate-derived disaccharides. In some embodiments, the enzyme is N-sulfoglucosamine sulfohydrolase (SGSH) and the LSD is Sanfilippo syndrome. In some embodiments, the enzyme is acid sphingomyelinase (ASM) and the LSD is Niemann-Pick disease. In some embodiments, the enzyme is β-glucocerebrosidase (GBA) and the LSd Gaucher's disease.

In some embodiments, the agent (e.g., therapeutic agent) comprises an antibody variable region. In some embodiments, the agent comprises an antibody fragment. In some embodiments, the agent comprises a Fab or an scFv. In some embodiments, the agent does not comprise an antibody variable region. In some instances, the agent does not comprise an anti-beta secretase 1 (BACE1) Fab.

Additional Embodiments and Linkers

A polypeptide (e.g., a modified CH3 or CH2 domain polypeptide as described further herein) may be joined to another domain of an Fc region. In some embodiments, a modified CH3 domain polypeptide is joined to a CH2 domain, which may be a naturally occurring CH2 domain or a variant CH2 domain, typically at the C-terminal end of the CH2 domain. In some embodiments, a modified CH2 domain polypeptide is joined to a CH3 domain, which may be a naturally occurring CH3 domain or a CH3 variant domain, typically at the N-terminal end of the CH3 domain. In some embodiments, the polypeptide comprising a modified CH2 domain joined to a CH3 domain, or the polypeptide comprising the modified CH3 domain joined to a CH2 domain, further comprises a partial or full hinge region of an antibody, thus resulting in a format in which the modified CH3 domain polypeptide or modified CH2 domain polypeptide is part of an Fc region having a partial or full hinge region. The hinge region can be from any immunoglobulin subclass or isotype. An illustrative immunoglobulin hinge is an IgG hinge region, such as an IgG1 hinge region, e.g., human IgG1 hinge amino acid sequence EPKSCDKTHTCPPCP (SEQ ID NO:95).

In some embodiments, an engineered TfR-binding polypeptide, TfR-binding peptide, or TfR-binding antibody is fused to a peptide or protein that is useful for protein purification, e.g., polyhistidine, epitope tags, e.g., FLAG, c-Myc, hemagglutinin tags and the like, glutathione S transferase (GST), thioredoxin, protein A, protein G, or maltose binding protein (MBP). In some cases, the peptide or protein to which the engineered TfR-binding polypeptide, TfR-binding peptide, or TfR-binding antibody is fused may comprise a protease cleavage site, such as a cleavage site for Factor Xa or Thrombin.

In methods of the present disclosure, an agent (e.g., therapeutic agent) is linked to a polypeptide or protein (e.g., an engineered TfR-binding polypeptide, a TfR-binding peptide, or a TfR-binding antibody). The linker may be any linker suitable for joining an agent to the polypeptide or protein. In some embodiments, the linkage is enzymatically cleavable. In certain embodiments, the linkage is cleavable by an enzyme present in the central nervous system.

In some embodiments, the linker is a peptide linker. The peptide linker may be configured such that it allows for the rotation of the agent (e.g., therapeutic agent) and the polypeptide or protein relative to each other; and/or is resistant to digestion by proteases. In some embodiments, the linker may be a flexible linker, e.g., containing amino acids such as Gly, Asn, Ser, Thr, Ala, and the like. Such linkers are designed using known parameters. For example, the linker may have repeats, such as Gly-Ser repeats.

In various embodiments, linking of the agent (e.g., therapeutic agent) to the polypeptide or protein (e.g., engineered TfR-binding polypeptide, TfR-binding peptide, or TfR-binding antibody) can be achieved using well-known chemical cross-linking reagents and protocols. For example, there are a large number of chemical cross-linking agents that are known to those skilled in the art and useful for cross-linking the polypeptide or protein with an agent of interest. For example, the cross-linking agents are heterobifunctional cross-linkers, which can be used to link molecules in a stepwise manner. Heterobifunctional cross-linkers provide the ability to design more specific coupling methods for conjugating proteins, thereby reducing the occurrences of unwanted side reactions such as homo-protein polymers.

The agent (e.g., therapeutic agent) may be linked to the N-terminal or C-terminal region of the polypeptide or protein, or attached to any region of the polypeptide or protein (e.g., engineered TfR-binding polypeptide, TfR-binding peptide, or TfR-binding antibody), so long as the agent does not interfere with binding of the polypeptide or protein to a transferrin receptor.

XIV. Pharmaceutical Compositions and Kits

In other aspects, pharmaceutical compositions and kits comprising a fusion protein described herein are provided.

Pharmaceutical Compositions

Guidance for preparing formulations for use in the present disclosure can be found in any number of handbooks for pharmaceutical preparation and formulation that are known to those of skill in the art.

In some embodiments, a pharmaceutical composition comprises a fusion protein as described herein and further comprises one or more pharmaceutically acceptable carriers and/or excipients. A pharmaceutically acceptable carrier includes any solvents, dispersion media, or coatings that are physiologically compatible and that do not interfere with or otherwise inhibit the activity of the active agent.

In some embodiments, the carrier is suitable for intravenous, intrathecal, ocular, intracerebroventricular, intramuscular, oral, intraperitoneal, transdermal, topical, or subcutaneous administration. Pharmaceutically acceptable carriers can contain one or more physiologically acceptable compounds that act, for example, to stabilize the composition or to increase or decrease the absorption of the polypeptide. Physiologically acceptable compounds can include, for example, carbohydrates, such as glucose, sucrose, or dextrans, antioxidants, such as ascorbic acid or glutathione, chelating agents, low molecular weight proteins, compositions that reduce the clearance or hydrolysis of the active agents, or excipients or other stabilizers and/or buffers. Other pharmaceutically acceptable carriers and their formulations are also available in the art.

The pharmaceutical compositions described herein can be manufactured, e.g., by means of conventional mixing, dissolving, granulating, dragee-making, emulsifying, encapsulating, entrapping, or lyophilizing processes. The following methods and excipients are exemplary.

For oral administration, a fusion protein as described herein can be formulated by combining it with pharmaceutically acceptable carriers that are well known in the art. Such carriers enable the fusion protein to be formulated as tablets, pills, dragees, capsules, emulsions, lipophilic and hydrophilic suspensions, liquids, gels, syrups, slurries, suspensions and the like, for oral ingestion by a patient to be treated. Pharmaceutical preparations for oral use can be obtained by mixing the fusion protein with a solid excipient, optionally grinding a resulting mixture, and processing the mixture of granules, after adding suitable auxiliaries, if desired, to obtain tablets or dragee cores. Suitable excipients include, for example, fillers such as sugars, including lactose, sucrose, mannitol, or sorbitol; cellulose preparations such as, for example, maize starch, wheat starch, rice starch, potato starch, gelatin, gum tragacanth, methyl cellulose, hydroxypropylmethyl-cellulose, sodium carboxymethylcellulose, and/or polyvinylpyrrolidone. If desired, disintegrating agents can be added, such as a cross-linked polyvinyl pyrrolidone, agar, or alginic acid or a salt thereof such as sodium alginate.

As disclosed above, a fusion protein as described herein can be formulated for parenteral administration by injection, e.g., by bolus injection or continuous infusion. For injection, the fusion protein can be formulated into preparations by dissolving, suspending, or emulsifying them in an aqueous or nonaqueous solvent, such as vegetable or other similar oils, synthetic aliphatic acid glycerides, esters of higher aliphatic acids or propylene glycol; and if desired, with conventional additives such as solubilizers, isotonic agents, suspending agents, emulsifying agents, stabilizers, and preservatives. In some embodiments, the fusion protein can be formulated in aqueous solutions, such as physiologically compatible buffers, non-limiting examples of which include Hanks's solution, Ringer's solution, and physiological saline buffer. Formulations for injection can be presented in unit dosage form, e.g., in ampules or in multi-dose containers, with an added preservative. The compositions can take such forms as suspensions, solutions, or emulsions in oily or aqueous vehicles, and can contain formulatory agents such as suspending, stabilizing, and/or dispersing agents.

In some embodiments, a fusion protein as described herein is prepared for delivery in a sustained-release, controlled release, extended-release, timed-release, or delayed-release formulation, for example, in semi-permeable matrices of solid hydrophobic polymers containing the active agent. Various types of sustained-release materials have been established and are well known by those skilled in the art. Extended-release formulations include film-coated tablets, multiparticulate or pellet systems, matrix technologies using hydrophilic or lipophilic materials and wax-based tablets with pore-forming excipients. Usually, sustained release formulations can be prepared using naturally-occurring or synthetic polymers, for instance, polymeric vinyl pyrrolidones, such as polyvinyl pyrrolidone; carboxyvinyl hydrophilic polymers; hydrophobic and/or hydrophilic hydrocolloids, such as methylcellulose, ethylcellulose, hydroxypropyl cellulose, and hydroxypropylmethyl cellulose; and carboxypolymethylene.

Typically, a pharmaceutical composition for use in in vivo administration is sterile. Sterilization can be accomplished according to methods known in the art, e.g., heat sterilization, steam sterilization, sterile filtration, or irradiation.

Dosages and desired drug concentration of pharmaceutical compositions described herein may vary depending on the particular use envisioned. Suitable dosages are also described in Section XII above.

Kits

In some embodiments, a kit for use in treating an LSD, e.g., Hunter syndrome, Sanfilippo syndrome A, Niemann-Pick disease, Gaucher's disease, or Parkinson's disease, comprising a fusion protein as described herein is provided.

In some embodiments, the kit further comprises one or more additional therapeutic agents. For example, in some embodiments, the kit comprises a fusion protein as described herein and further comprises one or more additional therapeutic agents for use in the treatment of neurological symptoms of an LSD. In some embodiments, the kit further comprises instructional materials containing directions (i.e., protocols) for the practice of the methods described herein (e.g., instructions for using the kit for administering a fusion protein comprising the ERT enzyme across the blood-brain barrier). While the instructional materials typically comprise written or printed materials, they are not limited to such. Any medium capable of storing such instructions and communicating them to an end user is contemplated by this disclosure. Such media include, but are not limited to, electronic storage media (e.g., magnetic discs, tapes, cartridges, chips), optical media (e.g., CD-ROM), and the like. Such media may include addresses to internet sites that provide such instructional materials.

XV. Examples

The present disclosure will be described in greater detail by way of specific examples. The following examples are offered for illustrative purposes only, and are not intended to limit the disclosure in any manner. Those of skill in the art will readily recognize a variety of noncritical parameters which can be changed or modified to yield essentially the same results. Efforts have been made to ensure accuracy with respect to numbers used (e.g., amounts, temperatures, etc.), but some experimental error and deviation may be present. The practice of the present disclosure will employ, unless otherwise indicated, conventional methods of protein chemistry, biochemistry, recombinant DNA techniques and pharmacology, within the skill of the art. Such techniques are explained fully in the literature. Additionally, it should be apparent to one of skill in the art that the methods for engineering as applied to certain libraries can also be applied to other libraries described herein.

Example 1. Construction of Fusion Proteins Comprising Iduronate 2-Sulfatase (IDS)

Design and Cloning

IDS-Fc fusion proteins were designed that contain (i) a fusion polypeptide where a mature, human IDS enzyme is fused to a human IgG1 fragment that includes the Fc region (an “IDS-Fc fusion polypeptide”), and (ii) a modified human IgG1 fragment which contains mutations in the Fc region that confer transferrin receptor (TfR) binding (a “modified Fc polypeptide”). In particular, IDS-Fc fusion polypeptides were created in which IDS fragments were fused to either the N- or C-terminus of the human IgG1 Fc region. In some cases, a linker was placed between the IDS and IgG1 fragments to alleviate any steric hindrance between the two fragments. In all constructs, the signal peptide from the kappa chain V-III, amino acids 1-20 (UniProtKB ID—P01661) was inserted upstream of the fusion to facilitate secretion, and IDS was truncated to consist of amino acids S26-P550 (UniProtKB ID—P22304). The fragment of the human IgG1 Fc region used corresponds to amino acids D104-K330 of the sequence in UniProtKB ID P01857 (positions 221-447, EU numbering, which includes 10 amino acids of the hinge (positions 221-230)). In some embodiments, a second Fc polypeptide derived from human IgG1 residues D104-K330 but lacking the IDS fusion was co-transfected with the IDS-Fc fusion polypeptide in order to generate heterodimeric fusion proteins with one IDS enzyme (a “monozyme”). In some constructs, the IgG1 fragments contained additional mutations to facilitate heterodimerization of the two Fc regions. Control IDS-Fc fusion proteins that lack the mutations that confer TfR binding were designed and constructed analogously, with the difference being that these proteins lacked the mutations that confer TfR binding. As an additional control, we generated IDS (amino acids S26-P550) with a C-terminal hexahistidine tag (SEQ ID NO:241) to facilitate detection and purification.

The TfR-binding IDS-Fc fusion proteins used in the examples are dimers formed by an IDS-Fc fusion polypeptide and a modified Fc polypeptide that binds to TfR. For dimers where the IDS enzyme is linked to the N-terminus of the Fc region, the IDS-Fc fusion polypeptide may have the sequence of any one of SEQ ID NOS:115, 231, and 235. In these sequences, the IDS sequence is underlined and contains a cysteine at position 59 (double underlined) modified to formylglycine. The IDS was joined to the Fc polypeptide by a GGGGS linker (SEQ ID NO:239). A portion of an IgG1 hinge region (DKTHTCPPCP; SEQ ID NO:113) was included at the N-terminus of the Fc polypeptide. The CH2 domain sequence starts at position 541 of SEQ ID NOS:115, 231, and 235.

The IDS-Fc fusion protein ETV:IDS 35.21 used in the examples is a dimer formed by an IDS-Fc fusion polypeptide having the sequence of any one of SEQ ID NOS:115, 231, and 235 and a modified Fc polypeptide that binds to TfR having the sequence of SEQ ID NO:116. The first 10 amino acids are a portion of an IgG1 hinge region. The CH2 domain sequence starts at position 11 of SEQ ID NO:116.

The IDS-Fc fusion protein ETV:IDS 35.21.17.2 used in the examples is a dimer formed by an IDS-Fc fusion polypeptide having the sequence of any one of SEQ ID NOS:115, 231, and 235 and a modified Fc polypeptide that binds to TfR having the sequence of SEQ ID NO:228. The first 10 amino acids are a portion of an IgG1 hinge region. The CH2 domain sequence starts at position 11 of SEQ ID NO:228.

The IDS-Fc fusion protein ETV:IDS 35.23.2 used in the examples is a dimer formed by an IDS-Fc fusion polypeptide having the sequence of any one of SEQ ID NOS:115, 231, and 235 and a modified Fc polypeptide that binds to TfR having the sequence of SEQ ID NO:229. The first 10 amino acids are a portion of an IgG1 hinge region. The CH2 domain sequence starts at position 11 of SEQ ID NO:229.

The IDS-Fc fusion protein ETV:IDS 35.21.17 used in the examples is a dimer formed by an IDS-Fc fusion polypeptide having the sequence of any one of SEQ ID NOS:115, 231, and 235 and a modified Fc polypeptide that binds to TfR having the sequence of SEQ ID NO:151. The N-terminus of the modified Fc polypeptide may include a portion of an IgG1 hinge region (e.g., SEQ ID NO:113).

Recombinant Protein Expression and Purification

To express recombinant IDS enzyme fused to an Fc region, ExpiCHO cells (Thermo Fisher Scientific) were transfected with relevant DNA constructs using Expifectamine™ CHO transfection kit according to manufacturer's instructions (Thermo Fisher Scientific). Cells were grown in ExpiCHO™ Expression Medium at 37° C., 6% CO₂ and 120 rpm in an orbital shaker (Infors HT Multitron). In brief, logarithmic growing ExpiCHO™ cells were transfected at 6×10⁶ cells/ml density with 0.8 μg of DNA plasmid per mL of culture volume. After transfection, cells were returned to 37° C. and transfected cultures were supplemented with feed as indicated 18-22 hrs post transfection. Transfected cell culture supernatants were harvested 120 hrs post transfection by centrifugation at 3,500 rpm from 20 mins. Clarified supernatants were filtered (0.22 μM membrane) and stored at 4° C. Expression of an epitope-tagged IDS enzyme (used as a control) was carried out as described above with minor modifications. In brief, an IDS enzyme harboring a C-terminal hexahistidine tag (SEQ ID NO:241) was expressed in ExpiCHO cells.

IDS-Fc fusion proteins with (or without) engineered Fc regions conferring TfR binding were purified from cell culture supernatants using Protein A affinity chromatography. Supernatants were loaded onto a HiTrap Mab Select SuRe Protein A affinity column (GE Healthcare Life Sciences using an Akta Pure System). The column was then washed with >20 column volumes (CVs) of PBS. Bound proteins were eluted using 100 mM citrate/NaOH buffer pH 3.0 containing 150 mM NaCl. Immediately after elution, fractions were neutralized using 1 M arginine-670 mM succinate buffer pH 5.0 (at a 1:5 dilution). Homogeneity of IDS-Fe fusion proteins in eluted fractions was assessed by reducing and non-reducing SDS-PAGE.

To purify hexahistadine-tagged (SEQ ID NO:241) IDS enzyme, transfected supernatants were exhaustively dialyzed against 15 L of 20 mM HEPES pH 7.4 containing 100 mM NaCl overnight. Dialyzed supernatants were bound to a HisTrap column (GE Healthcare Life Sciences using an Akta Pure System). After binding, the column was washed with 20 CV of PBS. Bound proteins were eluted using PBS containing 500 mM imidazole. Homogeneity of IDS enzyme in eluted fractions was assessed by reducing and non-reducing SDS-PAGE. Pooled fractions containing IDS enzyme were diluted 1:10 in 50 mM Tris pH 7.5 and further purified using Q Sepharose High Performance (GE Healthcare). After binding, the column was washed with 10 CV of 50 mM Tris pH 7.5. Bound proteins were eluted using a linear gradient to 50 mM Tris pH 7.5 and 0.5 M NaCl and collected in 1 CV fractions. Fraction purity was assessed by non-reducing SDS-PAGE. As shown in FIG. 1, purification yielded homogeneous IDS-Fc fusion proteins and hexahistidine-tagged (SEQ ID NO:241) IDS enzyme.

Example 2. Characterization of IDS Fusion Proteins

IDS-Fc Fusion Proteins with Engineered TfR Binding Site Bind to Human TfR

To determine whether IDS-Fc fusion proteins with engineered TfR binding affects the ability of the modified Fc domain to interact with human TfR, the affinity of this protein for human TfR was assessed using a Biacore™ surface plasmon resonance assay. Biacore™ Series S CM5 sensor chips were immobilized with anti-human Fab (human Fab capture kit from GE Healthcare). 5 μg/mL of the IDS-Fc fusion proteins were captured for 1 minute on each flow cell and serial 3-fold dilutions of human apical domain TfR were injected at a flow rate of 30 μL/min. Each sample was analyzed with a 3-minute association and a 3-minute dissociation. After each injection, the chip was regenerated using 10 mM glycine-HCl (pH 2.1). Binding response was corrected by subtracting the RU from a flow cell capturing an irrelevant IgG at similar density. Steady-state affinities were obtained by fitting the response at equilibrium against the concentration using Biacore™ T200 Evaluation Software v3.1. As shown in FIG. 2, Biacore™ analysis established that the IDS-Fc fusion proteins with a TfR-binding site engineered into the Fc region binds to human TfR. Biacore™ analysis also established that the IDS-Fc fusion protein ETV:IDS 35.21 binds to human TfR with an affinity of ˜200 nM.

IDS-Fc Fusion Proteins with Engineered TfR Binding Site are Active In Vitro, in Cells, and In Vivo

The in vitro and cellular activity of engineered TfR-binding IDS-Fc fusion proteins were assessed to demonstrate that IDS maintains its enzymatic activity when fused to the human IgG fragment. In vitro activity was measured with a two-step fluorometric enzymatic assay using an artificial substrate. Specifically, 20 μL of 1 mM 4-Methylumbelliferyl a-L-idopyranosiduronic acid 2-sulphate disodium salt substrate (Carbosynth Limited, # EM03201) was diluted in the assay buffer (100 mM sodium acetate, 10 mM lead acetate, 0.05% Triton X-100, pH 5.0) and mixed with 10 μL of 0.2 nM IDS. The first reaction was incubated for 4 hr at 37° C. and terminated with 60 μL of 0.2 M phosphate-citrate buffer, pH 5.0. The second reaction was then carried out in the presence of 15 μg cell lysate from HEK 293T cells transiently transfected with human α-iduronidase (IDUA), incubated for 16 hr at 37° C., and stopped with the addition of 100 μL of 0.5 M sodium carbonate buffer, pH 10.5. Fluorescence of the reaction solution was then measured (excitation at 365 nm and emission at 450 nm). A 4-Methylumbelliferone standard curve was fit by linear regression to calculate the amount of product and verified as less than 10% of total substrate cleavage. Specific activity (nmol product/min/nmol IDS) was calculated by dividing the amount of product by the reaction time and molar amount of IDS.

The in vitro enzymatic activity assay demonstrated that IDS-Fc fusion proteins were active and indicated that the fusion of an Fc region to IDS does not interfere with enzymatic activity (FIG. 3).

IDS knockout (KO) cells were generated using CRISPR/CAS9 to provide a cellular system to test the cellular activity of the engineered IDS-Fc fusion proteins. HEK 293T cells (ATCC) were transfected with CRISPR/CAS9 pCas-Guide-EF1a-GFP vector (Origene) containing guide sequences targeted to the second half of exon 1 in human IDS. Single cell clones were analyzed for the presence of indels within the genomic sequence of IDS following Guide-it Mutation Detection Kit (Clontech) per manufacturer instructions. To identify IDS KO cells, indel positive clone cell lysates were analyzed using the in vitro IDS enzyme assay described above. Briefly, the in vitro activity assay was performed using 12.5, 25, 50 and 100 cell lysate in lead acetate assay buffer pH 5.0 (100 mM sodium acetate, 10 mM lead acetate, 0.02% NaAzide) as previously described (Vozyni et al., J. Inherit. Metab. Dis., 24:675-80 (2001)). The reaction was started by combining 10 μL normalized cell lysate (in water) with 1 mM substrate in 20 μL lead acetate buffer. The first reaction was incubated for 4 hr at 37° C. and terminated with 60 μL of 0.2 M phosphate-citrate buffer, pH 5.0. The second reaction was then carried out with the addition of 10 μg/10 μL cell lysate from HEK 293T cells transiently transfected with human α-iduronidase (IDUA) and allowed to proceed for 24 hr at 37° C., and stopped with the addition of 100 μL of 0.5 M sodium carbonate buffer, pH 10.3. Fluorescence of the reaction solution was then measured (excitation at 365 nm and emission at 450 nm). IDS activity in HEK 293T CRISPR clones was compared to recombinant IDS used as an assay standard, HEK wild-type (WT) lysates, and HEK cell lysates over-expressing IDS. Clones with enzyme activity levels comparable to background signal were sequence verified after mini-Topo (ThermoFisher) cloning and confirmed as KO clones. Subsequent cell assays use three unique and verified IDS KO clones and three independent batches of WT HEK 293T cells.

To test the cellular activity of naked IDS enzyme or IDS-Fc fusion proteins, an LC-MS/MS-based glycomic assay was developed that allows monitoring of the amount of substrate accumulation (heparan sulfate and dermatan sulfate) as an indicator of IDS activity. Substrate accumulation was measured in IDS KO cells before and after addition of IDS or IDS-Fc fusion proteins to the cell culture media. Briefly, cells were washed three times with PBS, pelleted, and frozen. Cell pellets were sonicated in disaccharide digestion buffer (111 mM NH₄OAc, 11 mM CaOAc, pH 7.0). Protein concentration was measured using BCA assay (Pierce). Total protein (100 μg) was added to 100 μL digestion buffer with 2 mM DTT, 1.25 mIU Heparinase I (Galen), 1.25 mIU Heparinase II (Galen), 1.25 mIU Heparinase III (Galen), and 6.25 mIU Chondroitinase B (Galen). Heparan sulfate and dermatan sulfate digestion was complete after three hours at 30° C., after which 20 ng of internal standard (4UA-25-G1cNCOEt-65 HD009 [Galen]) was added to each sample. The enzymes were deactivated by the addition of 6 μL of 250 mM EDTA and samples were boiled at 95° C. for 10 minutes. Samples were then centrifuged at 16,000×G for 5 minutes at room temperature. Supernatant was transferred to an Amicon Ultra 30 KD centrifugal filter (Millipore) and centrifuged at 14,000×G for 15 minutes. Disaccharides were concentrated in the flow through and were resuspended in a mixture of [1:1, v/v] assay buffer:acetonitrile which was then transferred to mass-spectrometry vials for further analysis.

Analysis of disaccharides generated by enzymatic digestion of heparan and dermatan sulfate was performed by liquid chromatography (Shimadzu Nexera X2 system, Shimadzu Scientific Instrument, Columbia, Md., USA) coupled to electrospray mass spectrometry (Sciex 6500+ QTRAP, Sciex, Framingham, Mass., USA). For each analysis, 10 μL of sample was injected on an ACQUITY UPLC BEH Amide 1.7 μm, 2.1×150 mm column (Waters Corporation, Milford, Mass., USA) using a flow rate of 0.4 mL/min with column temperature at 50° C. Mobile phase A consisted of water with 10 mM ammonium formate and 0.1% formic acid. Mobile phase B consisted of acetonitrile with 0.1% formic acid. The gradient was programmed as follows: 0.0-1.0 min at 85% B, 1.0-5.0 min from 85% B to 50% B, 5.0-6.0 min 50% B to 85% B, 6-8.0 min hold at 85% B. Electrospray ionization was performed in the negative-ion mode applying the following settings: curtain gas at 30; collision gas was set at medium; ion spray voltage at −4500; temperature at 450; ion source gas 1 at 50; ion source gas 2 at 60. Data acquisition was performed using Analyst 1.6.3 (Sciex) in multiple reaction monitoring mode (MRM), with dwell time 25 (msec). Collision energy at −30; declustering potential at −80; entrance potential at −10; collision cell exit potential at −10. GAGs were detected as [M−H]− using the following MRM transitions: D0A0 at m/z 378.1>87.0; D0a0 at m/z 378.1>175.0; D0S0 at m/z 416.1>138.0; D0a4 at m/z 458.1>300.0; D0A6,D2A0, D0a6, D2a0 at m/z 458.1>97.0; D0S6, D2S0 at m/z 496.0>416.1; D2a4, D2a6, D0a10, D2A6 at m/z 538.0>458.0; D0S6 at m/z 575.95>97.0 4UA-2S-G1cNCOEt-6S at m/z 472.0 (fragment ion) >97.0 was used as internal standard (I.S.). GAGs were identified based on their retention times and MRM transitions match to commercially available reference standards (Iduron Ltd, Manchester, UK). Quantification was performed using MultiQuant 3.0.2 (Sciex) by the area ratio to I.S. GAGs were normalized to total protein amount. Protein concentration was measured using BCA assay (Pierce).

Significant substrate accumulation, as reflected by the amount of disaccharides observed after digestion of heparan sulfate and dermatan sulfate, was seen in IDS KO cells compared to control cell lines, an effect that could be rescued with addition of recombinant IDS to the cells (FIG. 4). This validated that the LC-MS/MS-based assay can be used to assess the cellular activity of IDS and IDS-Fc fusion proteins.

Using this assay, it was established that treatment of cells with IDS-Fc fusion proteins as either an N-terminal monozyme (i.e., ETV:IDS 35.21.17) or a C-terminal monozyme comprising the same TfR-binding Fc polypeptide (i.e., CH3C.35.21.17) reduced the levels of heparan and dermatan sulfate-derived disaccharides back to that seen in wild-type cells (FIG. 5A). In addition, the activity of the N-terminal monozyme was comparable to IDS (FIG. 5B). Together, these data demonstrate that IDS-Fc fusion proteins maintain enzymatic activity and can reduce substrate accumulation in IDS KO cells.

The cellular activity of IDS-Fc fusion proteins was also examined in fibroblasts from MPS II patients and healthy controls using a ³⁵S pulse-chase assay, in which ³⁵S is integrated into newly-synthesized GAGs, as previously described (Lu et al., Bioconjugate Chemistry, 21:151-156 (2010)). MPS II patient fibroblasts lack detectable IDS activity, leading to an approximate 10-fold accumulation of substrate and 2.5-fold accumulation of ³⁵S signal (FIG. 5C). Similar to the IDS KO cells, IDS-Fc fusion proteins such as ETV:IDS 35.23.2 were highly efficacious in MPS II patient-derived cells, displaying a low picomolar cellular EC₅₀ for reducing the accumulation of S³⁵-labeled proteins (FIG. 5C). Furthermore, cellular activity of IDS-Fc fusion proteins such as ETV:IDS 35.23.2 was demonstrated to be M6PR-dependent, as an excess of M6P inhibited the clearance of ³⁵S-labeled proteins in treated MPS II patient fibroblasts (FIG. 5D). Collectively, these data demonstrate that the M6PR-dependent trafficking and cellular activity of IDS can be maintained in the IDS-Fc fusion protein format.

To measure heparan and dermatan sulfate-derived disaccharides in vivo, the LC-MS/MS-based glycomics assay was adapted for analysis from tissues and fluids. Briefly, all tissues and fluids were collected and then immediately frozen and stored at −80° C. Samples were subjected to 5 freeze-thaw cycles and processed as described above for cell analysis. Significant accumulation of heparan sulfate and dermatan sulfate-derived disaccharides was seen in all tissues and fluids analyzed from male IDS KO mice compared to male wild-type littermate controls (Table 1). This assay is used for efficacy studies of the fusion proteins in vivo. IDS KO mice were obtained from The Jackson Laboratories (JAX strain 024744).

TABLE 1 Glycomic analysis of tissue and fluids from IDS KO mice. Amount of total HS and DS-derived disaccharides Sample Fold change (ng/100 ug protein) Type (over WT) WT IDS KO Liver 86.3  6.3 (+/−0.4) 543.6 (+/−14.7) Kidney 14.1 29.3 (+/−0.9) 413.4 (+/−6.1)  Lung 9.0 40.3 (+/−6.9) 362.6 (+/−37.0) Brain 8.8  5.8 (+/−0.5) 51.3 (+/−2.5) Spleen 7.4 19.5 (+/−5.8) 145.2 (+/−5.3)  Plasma 49.1  0.4 (+/−0.03) 20.0 (+/−5.1) Serum 43.1  0.5 (+/−0.03) 23.5 (+/−4.8) CSF 20.1  0.04 (+/−0.007)  0.78 (+/−0.05) Urine 13.8 38.83* (+/−1.4)  537.3* (+/−92.9) 

Using this method, the levels of heparan and dermatan sulfate-derived disaccharides were assessed in serum from wild-type (WT) mice dosed with vehicle and IDS KO mice dosed with IDS or an IDS-Fc fusion protein (i.e., ETV:IDS 35.21). Baseline measurements prior to dosing demonstrated significant accumulation of heparan and dermatan sulfate-derived disaccharides in serum from IDS KO mice compared to WT mice. After dosing with the IDS-Fc fusion protein, the levels of heparan and dermatan sulfate-derived disaccharides were significantly reduced in IDS KO serum, showing a comparable reduction as that seen in serum from IDS KO mice dosed with IDS (FIG. 6). These data demonstrate that the IDS-Fc fusion protein is active in vivo and can reduce substrate accumulation in IDS KO mice. Based on these data, the distribution and pharmacodynamic (PD) response was assessed in tissues of IDS KO mice seven days after a single dose of the IDS-Fc fusion protein. IDS KO mice were intravenously administered 40 mg/kg of IDS-Fc fusion protein or 5.3 mg/kg of IDS (25% molar equivalent dose) as a positive control, and GAG levels were assessed. Distribution of both molecules in peripheral tissues was confirmed at two hours post-dose. Significant substrate reduction was observed in the liver, spleen, and lung of IDS KO mice seven days after dosing with the IDS-Fc fusion protein (FIG. 7).

To determine whether TfR-binding IDS-Fc fusion proteins showed improved brain delivery compared to a control IDS-Fc fusion protein, human TfR knock-in (TfR^(ms/hu) KI) mice were dosed with 50 mg/kg of the TfR-binding IDS-Fc fusion protein ETV:IDS 35.21 or a control IDS-Fc fusion protein lacking the mutations that confer TfR binding (“IDS:Fc”), and the concentration of the IDS-Fc fusion protein in brain was measured using a sandwich ELISA-based assay described in Example 3 below at 4 hours post-dose. TfR^(ms/hu) KI mice were generated as described in International Patent Publication No. WO 2018/152285 using CRISPR/Cas9 technology to express human Tfrc apical domain within the murine Tfrc gene; the resulting chimeric TfR was expressed in vivo under the control of the endogenous promoter. Significantly higher levels of the IDS-Fc fusion protein ETV:IDS 35.21 were detected in brain compared to the control IDS-Fc fusion protein, with an average brain concentration of 23.7 nM for ETV:IDS 35.21 (FIG. 8). The brain uptake of two additional TfR-binding IDS-Fc fusion proteins, ETV:IDS 35.21.17.2 and ETV:IDS 35.23.2, was assessed using the TfR^(ms/hu) KI mice. TfR^(ms/hu) KI mice were dosed with 50 mg/kg of ETV:IDS 35.21.17.2, ETV:IDS 35.23.2, or the control IDS-Fc fusion protein (“IDS:Fc”), and the concentration of the IDS-Fc fusion protein in brain was measured using the sandwich ELISA-based assay at 2 hours and 8 hours post-dose. Administration of the TfR-binding IDS-Fc fusion proteins led to a 5-fold increase in brain uptake relative to the control IDS-Fc fusion protein at 2 hours and a 10-20 fold increase in brain concentration at 8 hours post-dose (FIG. 9A). Serum PK and accumulation of the intact fusion protein in the liver was equivalent for both ETV:IDS 35.21 and IDS:Fc (FIG. 9B), indicating that the IDS moiety largely determines the distribution phase of the plasma clearance. Brain levels of TfR-binding IDS-Fc fusion proteins remained elevated for eight hours, decreasing slightly with peripheral clearance. Together, these data demonstrate that the interaction of the TfR-binding IDS-Fc fusion proteins with TfR generally maintains peripheral distribution while significantly improving brain exposure.

Intravenous Administration of ETV:IDS Reduces GAGs in the Brain

To examine whether the improved brain exposure observed with the TfR-binding IDS-Fc fusion proteins described above and prepared in accordance with Example 1 (referred to herein as ETV:IDS) produced a corresponding reduction of accumulated substrates in the brain, a mouse model deficient for IDS that harbors the human TfR apical domain knocked into the murine TfR was generated (referred to herein as IDS KO×TfR^(ms/hu) KI mice). Briefly, TfR^(ms/hu) KI male mice were bred to female IDS heterozygous mice to generate IDS KO mice in a TfR^(ms/hu) KI homozygous background. All mice used in this study were males and housed under a 12 hour light-dark cycle with ad libitum access to food (LabDiet IL irradiated 6F) and water.

IDS KO×TfR^(ms/hu) KI mice were intravenously administered either single or four weekly activity-equivalent doses of ETV:IDS or IDS (747 nmol product/min/kg or 40 mg/kg and 14.2 mg/kg, respectively), and pharmacokinetic and pharmacodynamic responses were assessed. In particular, the effect of peripheral administration of ETV:IDS on brain and tissue GAG in IDS KO×TfR^(ms/hu) KI mice was determined using 2-month-old IDS KO×TfR^(ms/hu) KI mice injected intravenously (i.v.) with saline, IDS (14.2 mg/kg body weight), or ETV:IDS (40 mg/kg body weight) either once (n=8) or once every week for 4 weeks (n=8). 2-month-old littermate TfR^(ms/hu) KI mice, injected i.v. with saline either once (n=5) or once every week for 4 weeks (n=5), were used as controls. For animals dosed with IDS or ETV:IDS, in-life serum samples were collected by submandibular bleed at various time points. All animals were sacrificed either 7 days post single dose or 7 days following last 4 week dose. Urine, serum, CSF, liver, kidney, spleen, lung, heart and right hemibrain were dissected and flash-frozen on dry ice.

Following a single dose, ETV:IDS exhibited a similar serum clearance profile as IDS, as assessed using an ELISA-based assay described in Example 3 below for detecting the concentration of IDS, providing additional support that the enzyme largely dictates peripheral clearance (FIG. 10A). Brain levels of ETV:IDS were significantly increased compared to IDS two hours post-dose, with a mean concentration of 8.4 and 1.6 nM, respectively, and liver and spleen levels of ETV:IDS were significantly elevated compared to IDS (FIG. 10B).

To determine whether ETV:IDS reduces substrate levels in the brain, GAG levels were assessed as described in Example 2 in IDS KO×TfR^(ms/hu) KI mice after a single dose or four, weekly doses of enzyme. IDS marginally decreased brain GAG levels at early time points but was ineffective at significantly lowering GAGs after four weeks of treatment (FIG. 10C). ETV:IDS, however, reduced brain GAG levels by approximately 58% following a single dose and 71% following four weeks of treatment (FIG. 10C). This led to a concomitant reduction of CSF GAGs by approximately 75% after a single dose which was sustained after four weeks of dosing (FIG. 10C). Both molecules effectively lowered GAG levels in liver and spleen after one week, and the response was sustained with repeat dosing (FIG. 10C), demonstrating that TfR binding does not negatively impact pharmacodynamic responses in these tissues. Together, these data demonstrate that ETV:IDS significantly increases brain exposure of enzyme and robustly reduces substrate accumulation in both the periphery and CNS.

Example 3. Pharmacokinetic Characterization of IDS Fusion Proteins

This example describes pharmacokinetic (PK) characterization of engineered IDS-Fc fusion proteins in mouse plasma.

To determine the plasma half-life and clearance of TfR-binding IDS-Fc fusion proteins, 7-8 week old male C57BL/6 mice were dosed with 10 mg/kg of two IDS-Fc fusion protein molecules (an N-terminal monozyme and a C-terminal monozyme) via tail vein injection. The concentration of IDS-Fc fusion proteins remaining in plasma over a 24-hour period was measured using an ELISA-based assay. Briefly, the concentration of IDS-Fc fusion proteins in mouse plasma was quantified using a sandwich ELISA. An anti-Fc capture antibody (Abcam # ab124055) was coated onto a 384-well MaxiSorp™ plate (Thermo Scientific #464718) at 3 μg/mL. The plate was blocked with 5% BSA and then incubated with plasma diluted either 1:1,000 or 1:10,000. Next, a polyclonal anti-IDS detection antibody (R&D Systems # AF2449) was added at 0.5 μg/mL followed by an anti-goat-HRP antibody. The plates were developed using TMB substrate, stopped with sulfuric acid, and the absorbance at 450 nm measured on a BioTek plate reader. The standard curves were the individual constructs from 200-0.1 ng/mL in a 4-fold dilution series and were fit using a four-parameter logistic regression.

Using this assay, it was established that the terminal plasma half-life of IDS-Fc fusion proteins was 7.7-10 hrs (Table 2). No unanticipated PK liabilities were seen with IDS-Fc fusion proteins in vivo.

TABLE 2 Pharmacokinetics of IDS-Fc fusion proteins in mice over 24 hours. ETV:IDS Dose CL CL Terminal t_(1/2) Description (mg/kg) (ml/h/kg) (ml/d/kg) (hrs) N-terminal 10 22.0 528 10 monozyme C-terminal 10 42.6 1020 7.7 monozyme

Example 4. Construction of Fusion Proteins Comprising Acid Sphingomyelinase (ASM)

Design and Cloning

ASM-Fc fusion proteins were designed as dimers of a fusion polypeptide where a mature, human ASM enzyme is fused to a human IgG1 fragment that includes the Fc region (an “ASM-Fc fusion polypeptide”). In some embodiments, an ASM-Fc fusion polypeptide comprises a modified Fc region containing mutations that confer transferrin receptor (TfR) binding. In particular, ASM-Fc fusion polypeptides were created in which ASM fragments were fused to the N-terminus of the human IgG1 Fc region. In some cases, a linker was placed between the ASM and IgG1 fragments to alleviate any steric hindrance between the two fragments. In all constructs, the native ASM signal sequence, amino acids 1-46 (UniProtKB ID—P17405), was removed and replaced with a secretion signal from the kappa chain V-III, amino acids 1-20 (UniProtKB ID—P01661), to improve secretion of ASM. Additionally, in the fusion proteins, ASM was truncated at its C-terminus, ending at amino acid Q620, to prevent any unwanted cleavage between ASM and the human IgG1 Fc region. A fragment of the human IgG1 Fc region (UniProtKB ID—P01857) was then placed in frame with the C-terminus of ASM beginning at amino acid E99, with the cysteine at position 103 being mutated to serine. In some embodiments, the IgG1 fragments contained additional mutations to facilitate heterodimerization of the two Fc regions. Additionally, ASM-Fc fusion proteins were generated containing one or two molecules of ASM. As a control, ASM-hexahistidine (SEQ ID NO:241) fusion proteins were designed consisting of ASM amino acids 1-628, truncated to remove the C-terminal cysteine and promote enzymatic activation, and a C-terminally fused hexahistidine tag (SEQ ID NO:241).

Recombinant Protein Expression and Purification

To express recombinant ASM enzyme fused to an Fc region, ExpiCHO-S cells (Thermo Fisher) were transfected at 6×10⁶ cells/ml density with Expifectamine CHO/plasmid DNA complex according to manufacturer's instructions (Thermo Fisher Scientific). After transfection, cells were incubated at 32° C. with a humidified atmosphere of 6-8% CO₂ in an orbital shaker (Infors HT Multitron). On day one post-transfection, Expifectamine enhancer and Expifectamine feed were added to the culture. Media supernatant was harvested by centrifugation after 48-72 hour expression time. The clarified supernatant was supplemented with EDTA-free protease inhibitor (Roche) and was stored at −80° C.

For ASM-Fc fusion protein purification, clarified media supernatant was supplemented with 200 μM zinc acetate (Sigma Aldrich). The supernatant was loaded on HiTrap Mab Select SuRe Protein A affinity column (GE Healthcare Life Sciences) and washed with 200 mM arginine and 137 mM succinate buffer pH 5.0 (arginine-succinate buffer). The fusion proteins were eluted in 100 mM QB citrate buffer pH 3.0 supplemented with 200 μM zinc acetate. Immediately after elution, the arginine-succinate buffer was added to adjust the pH. Protein aggregates were separated from ASM-Fc fusion proteins by size exclusion chromatography (SEC) on Superdex 200 increase 10/300 GL column (GE Healthcare Life Sciences). The SEC mobile phase was kept in arginine-succinate pH 5.0 buffer supplemented with 200 μM zinc acetate. All chromatography steps were performed on using an Akta Pure System or Akta Avant system (GE Healthcare Life Sciences). Fraction purity was assessed by non-reducing SD S-PAGE. As shown in FIG. 11, purification yielded homogeneous ASM-Fc fusion proteins.

Example 5. Characterization of ASM Fusion Proteins

ASH-Fc Fusion Proteins are Active In Vitro and in Cells

To demonstrate that ASM maintains its enzymatic activity when fused to the human IgG heavy chain, the in vitro and cellular activity of ASM-Fc fusion proteins were assessed. In vitro activity of recombinant ASM enzyme or recombinant ASM-Fc fusion proteins were measured using a synthetic chromogenic analog of sphingomyelin. Specifically, 2.5 mM 2-(N-hexadecanoylamino)-4-nitrophenylphosphorylcholine (EMD Millipore) was mixed with 0.75 nM ASM in 100 mM sodium acetate buffer (pH 5.3; final concentrations in 100 μL reaction volume). The reaction was incubated for 16 hr at 37 and stopped with the addition of an equal volume of 0.2 M NaOH. Absorbance of the reaction solution was then measured at 410 nm. A p-nitrophenol standard curve was fit by linear regression to calculate the amount of product and verified as less than 10% of total substrate cleavage, Specific activity (nmol product/min/nmol ASM) was calculated by dividing the amount of product by the reaction time and molar amount of ASM. The in vitro enzymatic activity assay demonstrated that ASM-Fc fusion proteins are active and indicate that fusion of an Fc region to ASM does not interfere with its enzymatic activity (FIG. 12).

ASM KO cells were generated using CRISPR/CAS9 to provide a cellular system to test the cellular activity of ASM-Fc fusion proteins. HEK 293T cells (ATCC) were transfected with CRISPR/CAS9 pCas-Guide-EF1a-GFP vector (Origene) containing guide sequences targeted to the second half of exon 2 in human SMPD1. Single cell clones were analyzed for the presence of indels within the genomic sequence of ASM following Guide-it Mutation Detection Kit (Clontech) per manufacturer's instructions. Indel positive clone cell lysates were subjected to an in vitro ASM enzyme assay using the ASM chromogenic substrate 2-N-Hexadecanoylamino-4-nitrophenylphosphorylcholine (EMD Millipore). Briefly, the in vitro activity assay was performed using 12.5, 25, 50 and 100 μs cell lysate in 100 mM sodium acetate buffer (pH 5.3). The reaction was started by the addition of 2.5 mM substrate and stopped after 20 hours with addition of 0.2 M NaOH. ASM activity in HEK293T CRISPR clones was compared to recombinant ASM used as an assay standard, HEK wild-type (WT) lysates, and HEK cell lysates overexpressing ASM. Clones with enzyme activity levels comparable to background signal were sequence verified after mini-Topo (Thermo Fisher Scientific) cloning and confirmed as KO clones. Subsequent cell assays used three unique and verified ASM KO clones and three independent batches of WT HEK293T cells.

To test the cellular activity of naked ASM enzyme or ASM-Fc fusion proteins, two cellular assays were developed that allow monitoring of the amount of substrate accumulation (sphingomyelin) in ASM KO cells basally and after treatment with ASM or ASM-Fc fusions. First, an imaging-based assay was developed to monitor the amount of BODIPY-conjugated C5-sphingomyelin accumulation in ASMKO cells. Briefly, HEK293T WT and ASMKO cells were plated in DMEM supplement with 10% FBS (Gibco) at low density onto PDL-coated, 96-well plates (Perkin Elmer). Four hours post-plating, recombinant ASM enzyme, ASM-Fc fusion proteins, or control buffer were added to each well and incubated for 48 hours at 37° C. Media was removed, replaced with fresh media containing 1 μM BODIPY-C5-sphingomyelin (Thermo Fisher Scientific), and incubated at 37° C. for 16 hours. Cells were then washed with PBS, fixed with 4% paraformaldehyde, and stained with nuclear (DAPI, Thermo Fisher) and cytoplasmic (far red cell mask, Thermo Fisher Scientific) stains. Images were acquired on the Opera Phenix confocal microscope (Perkin Elmer) with a 63× objective with multiple fields per well and triplicate wells per condition. Image analysis was performed using Harmony software (Perkin Elmer) that detects and analyzes the average total intensity, puncta number, and puncta intensity of the BODIPY-C5-sphingomylein on a per cell basis. These per cell values were then averaged into a per well value that was used to analyze the effect of genotype and/or treatment on the accumulation of BODIPY-C5-sphingomyelin. Significant BODIPY-C5-sphingomyelin accumulation was seen in ASM KO cells compared to control cell lines, an effect that could be rescued with the addition of recombinant ASM enzyme and ASM-Fc fusion proteins (FIG. 13).

To further verify that ASM-Fc fusion proteins maintain their activity in cells, an LC-MS/MS-based assay was developed to monitor the accumulation of endogenous sphingomyelin in ASMKO cells. HEK293T WT and ASMKO cells were cultured and treated with enzyme as described above. At 68 hours post-plating, with or without ASM or ASM-Fc fusion protein treatment, cells were washed thoroughly with PBS, and lipids were extracted with a mixture of water:methanol [1:1, v/v] spiked with appropriate internal standards. Lipids were extracted using methyl-tert-butyl ether (MTBE), vortexed, and centrifuged at 10,000×g and 4° C. for 10 min. The upper MTBE fraction containing lipids was then evaporated to dryness under gentle nitrogen stream. Lipids were resuspended in a mixture of isopropanol:acetonitrile:water [2:1:1, v/v/v] and then transferred to mass-spectrometry vials for further analysis.

Lipid analyses were performed by liquid chromatography (Shimadzu Nexera X2 system, Shimadzu Scientific Instrument, Columbia, Md., USA) coupled to electrospray mass spectrometry (Sciex 6500+ QTRAP, Sciex, Framingham, Mass., USA). For each analysis, 5 of sample was injected on a BEH C18 1.7 μm, 2.1×100 mm column (Waters Corporation, Milford, Mass., USA) using a flow rate of 0.25 mL/min at 55° C. Mobile phase A consisted of 60:40 acetonitrile/water (v/v) with 10 mM ammonium formate+0.1% formic acid. Mobile phase B consisted of 90:10 isopropanol/acetonitrile (v/v) with 10 mM ammonium formate+0.1% formic acid. The gradient was programmed as follows: 0.0-8.0 min from 45% B to 99% B, 8.0-10.0 min at 99% B, 10.0-10.1 min to 45% B, and 10.1-12.0 min at 45% B. Electrospray ionization was performed in the positive-ion mode applying the following settings: curtain gas at 20; collision gas was set at medium; ion spray voltage at 5200; temperature at 250; ion source gas 1 at 50; ion source gas 2 at 60. Data acquisition was performed using Analyst 1.6 (Sciex) in multiple reaction monitoring mode (MRM). Collision energy at 40; declustering potential at 80; entrance potential at 10; collision cell exit potential at 12.5. Ceramides (Cer) were detected as [M−H2O+H]⁺ using the following MRM transitions: Cer d18:1/16:0 at m/z 538.5>264.3; Cer d18:1/18:0 at m/z 566.6>264.3; Cer d18:1/20:0 at m/z 594.6>264.3; Cer d18:1/22:0 at m/z 622.6>264.3; Cer d18:1/24:0 at m/z 650.6>264.3; Cer d18:1/24:1 at m/z 648.6>264.3; Cer d18:1/17:0 at m/z 552.4>264.3 was used as internal standard. Sphingomyelins (SM) were detected as [M+H]⁺ using the following MRM transitions: SM d18:1/16:0 at m/z 703.7>184.1; SM d18:1/18:0 at m/z 731.7>184.1; SM d18:1/20:0 at m/z 759.7>184.1; SM d18:1/22:0 at m/z 787.7>184.1; SM d18:1/24:0 at m/z 815.7>184.1; SM d18:1/24:1 at m/z 813.7>184.1; SM d18:1/18:1 (d9) at m/z 738.7>184.1 was used as internal standard. Lipids were identified based on their retention times and MRM properties of commercially available reference standards (Avanti Polar Lipids, Birmingham, Ala., USA). Quantification was performed using MultiQuant 3.02 (Sciex). Lipids were normalized to total protein amount. Protein concentration was measured using BCA assay (Pierce). LC-MS/MS analysis demonstrated that ASM-Fc fusion proteins can reduce the levels of endogenous sphingomyelin in ASM KO cells back to that seen in wild-type cells (FIG. 14). Further, ASM-Fc fusion proteins were as potent as naked ASM enzyme in reducing sphingomyelin in both assays (Table 3).

TABLE 3 ASM-Fc fusion proteins show similar potency to ASM in cellular assays. EC₅₀ EC₅₀ Molecule (LC/MS-MS) (Imaging assay) ASM 0.32 nM 0.47 nM ASM-Fc 0.25 nM 0.22 nM

Together, these data demonstrate that ASM-Fc fusion proteins retain their activity and can rescue substrate accumulation in ASM-deficient cells.

Example 6. Construction of Fusion Proteins Comprising N-Sulfoglucosamine Sulfohydrolase (SGSH)

Design and Cloning

SGSH-Fc fusion proteins were designed that contain (i) a fusion polypeptide where a mature, human SGSH enzyme is fused to a human IgG1 fragment that includes the Fc region (an “SGSH-Fc fusion polypeptide”), and (ii) a modified human IgG1 fragment which contains mutations in the Fc region that confer transferrin receptor (TfR) binding (a “modified Fc polypeptide”). In particular, SGSH-Fc fusion polypeptides were created in which SGSH fragments were fused to either the N- or C-terminus of the human IgG1 Fc region. In some cases, a linker was placed between the SGSH and IgG1 fragments to alleviate any steric hindrance between the two fragments. In all constructs, the signal peptide from the kappa chain V-III, amino acids 1-20 (UniProtKB ID—P01661) was inserted upstream of the fusion to facilitate secretion, and SGSH was truncated to consist of amino acids R21-L502 (UniProtKB ID—P51688). The fragment of the human IgG1 Fc region used corresponds to amino acids D104-K330 of the sequence in UniProtKB ID P01857 (positions 221-447, EU numbering, which includes 10 amino acids of the hinge (positions 221-230)). In some embodiments, a second Fc polypeptide derived from human IgG1 residues D104-K330 containing mutations in the Fc region conferring TfR binding but lacking the SGSH fusion was co-transfected with the SGSH-Fc fusion polypeptide in order to generate heterodimeric fusion proteins with one SGSH enzyme (a “monozyme”). In other embodiments, a second Fc polypeptide derived from human IgG1 residues D104-K330 containing mutations in the Fc region conferring TfR binding and fused to SGSH was co-transfected with the SGSH-Fc fusion polypeptide in order to generate heterodimeric fusion proteins with two SGSH enzymes (a “bizyme”). In some constructs, the IgG1 fragments contained additional mutations to facilitate heterodimerization of the two Fc regions. Control SGSH-Fc fusion proteins that lack the mutations that confer TfR binding were designed and constructed analogously. As an additional control, SGSH (amino acids R21-L502) was generated with a C-terminal hexahistidine tag (SEQ ID NO:241) to facilitate detection and purification.

The SGSH-Fc fusion proteins comprising TfR-binding used in the examples are dimers formed by an SGSH-Fc fusion polypeptide and a modified Fc polypeptide that binds to TfR, wherein the modified Fc polypeptide lacks the SGSH fusion (a “monozyme”) or is fused to a second SGSH molecule (a “bizyme”).

An SGSH-Fc fusion polypeptide comprising a mature human SGSH sequence fused to the N-terminus of an IgG1 Fc polypeptide sequence with hole and LALA mutations has the sequence of SEQ ID NO:149. The SGSH enzyme was joined to the Fc polypeptide by a GGGGS linker (SEQ ID NO:239) and the N-terminus of the Fc polypeptide included a portion of an IgG1 hinge region (DKTHTCPPCP; SEQ ID NO:113).

An SGSH-Fc fusion polypeptide comprising a mature human SGSH sequence fused to the C-terminus of an IgG1 Fc polypeptide sequence with hole and LALA mutations has the sequence of SEQ ID NO:150. The SGSH enzyme was joined to the Fc polypeptide by a GGGGS linker (SEQ ID NO:239) and the N-terminus of the Fc polypeptide may include a portion of an IgG1 hinge region (e.g., SEQ ID NO:113).

A modified Fc polypeptide that binds to TfR comprising the sequence of clone CH3C.35.21.17 (SEQ ID NO:58) with knob and LALA mutations has the sequence of SEQ ID NO:151. The N-terminus of the modified Fc polypeptide may include a portion of an IgG1 hinge region (e.g., SEQ ID NO:113).

An “N-terminal monozyme” containing a single SGSH molecule at the N-terminus of the Fc polypeptide was formed between SEQ ID NOS:149 and 151. A “C-terminal monozyme” containing a single SGSH molecule at the C-terminus of the Fc polypeptide was formed between SEQ ID NOS:150 and 151.

A modified Fc polypeptide that binds to TfR comprising a mature human SGSH sequence fused to the N-terminus of the sequence of clone CH3C.35.21.17 (SEQ ID NO:58) with knob and LALA mutations has the sequence of SEQ ID NO:154. The SGSH enzyme was joined to the modified Fc polypeptide by a GGGGS linker (SEQ ID NO:239) and the N-terminus of the modified Fc polypeptide included a portion of an IgG1 hinge region (SEQ ID NO:113).

An “N-terminal bizyme” containing a first SGSH molecule at the N-terminus of the Fc polypeptide and a second SGSH molecule at the N-terminus of the modified Fc polypeptide was formed between SEQ ID NOS:149 and 154.

Recombinant Protein Expression and Purification

To express recombinant SGSH enzyme fused to an Fc region, ExpiCHO cells (Thermo Fisher Scientific) were transfected with relevant DNA constructs using Expifectamine™ CHO transfection kit according to manufacturer's instructions (Thermo Fisher Scientific). Cells were grown in ExpiCHO™ Expression Medium at 37° C., 6% CO₂ and 120 rpm in an orbital shaker (Infors HT Multitron). In brief, logarithmic growing ExpiCHO™ cells were transfected at 6×10⁶ cells/ml density with 0.8 μg of DNA plasmid per mL of culture volume. After transfection, cells were returned to 37° C. and transfected cultures were supplemented with feed as indicated 18-22 hrs post transfection. Transfected cell culture supernatants were harvested 120 hrs post transfection by centrifugation at 3,500 rpm from 20 mins. Clarified supernatants were filtered (0.22 μM membrane) and stored at 4° C. Expression of an epitope-tagged SGSH enzyme (used as a control) was carried out as described above with minor modifications. In brief, an SGSH enzyme harboring a C-terminal hexahistidine tag (SEQ ID NO:241) was expressed in ExpiCHO cells.

SGSH-Fc fusion proteins with (or without) engineered Fc regions conferring TfR binding were purified from cell culture supernatants using Protein A affinity chromatography. Supernatants were loaded onto a HiTrap Mab Select SuRe Protein A affinity column (GE Healthcare Life Sciences using an Akta Pure System). The column was then washed with >20 column volumes (CVs) of PBS. Bound proteins were eluted using 100 mM citrate/NaOH buffer pH 3.0 containing 150 mM NaCl. Immediately after elution, fractions were neutralized using 1 M arginine-670 mM succinate buffer pH 5.0 (at a 1:5 dilution). Homogeneity of SGSH-Fc fusions in eluted fractions was assessed by reducing and non-reducing SDS-PAGE.

To purify hexahistidine-tagged (SEQ ID NO:241) SGSH, transfected supernatants were exhaustively dialyzed against 15 L of 20 mM HEPES pH 7.4 containing 100 mM NaCl overnight, and 20 mM imidazole was added to the dialyzed supernatants prior to purification. Dialyzed supernatants were bound to a HisTrap column (GE Healthcare Life Sciences using an Akta Pure System). After binding, the column was washed with 20 CV of PBS. Bound proteins were eluted using PBS containing 500 mM imidazole. Homogeneity of SGSH enzyme in eluted fractions was assessed by reducing and non-reducing SDS-PAGE. Pooled fractions containing SGSH can be diluted 1:10 in 50 mM Tris pH 7.5 and further purified using Q Sepharose High Performance (GE Healthcare). After binding, the column is washed with 10 CV of 50 mM Tris pH 7.5. Bound proteins are eluted using a linear gradient to 50 mM Tris pH 7.5 and 0.5 M NaCl and collected in 1 CV fractions. Fraction purity is assessed by non-reducing SDS-PAGE. Purification yields homogeneous SGSH-Fc fusion proteins and hexahistidine-tagged (SEQ ID NO:241) SGSH.

Example 7. Characterization of SGSH Fusion Proteins

SGSH-Fc Fusion Proteins with Engineered TfR Binding Site Bind to Human TfR

To determine whether SGSH-Fc fusion proteins with engineered TfR binding affects the ability of the modified Fc domain to interact with human TfR, the affinity of this protein for human TfR can be assessed using a Biacore™ surface plasmon resonance assay. Biacore™ Series S CM5 sensor chips are immobilized with anti-human Fab (human Fab capture kit from GE Healthcare). 5 μg/mL of the SGSH-Fc fusion proteins are captured for 1 minute on each flow cell and serial 3-fold dilutions of human apical domain TfR are injected at a flow rate of 30 μL/min. Each sample is analyzed with a 3-minute association and a 3-minute dissociation. After each injection, the chip is regenerated using 10 mM glycine-HCl (pH 2.1). Binding response is corrected by subtracting the RU from a flow cell capturing an irrelevant IgG at similar density. Steady-state affinities are obtained by fitting the response at equilibrium against the concentration using Biacore™ T200 Evaluation Software v3.1. Biacore™ analysis establishes that SGSH-Fc fusion proteins with a TfR-binding site engineered into the Fc region bind to human TfR.

SGSH-Fc Fusion Proteins with Engineered TfR Binding Site are Active In Vitro and in Cells

The in vitro and cellular activity of engineered TfR-binding SGSH-Fc fusion proteins were assessed to demonstrate that SGSH maintains its enzymatic activity when fused to the human IgG fragment. The in vitro activity of recombinant SGSH was measured using a two-step fluorometric enzymatic assay using an artificial substrate. Specifically, 20 μL of 1 mM 4-Methylumbelliferyl 2-deoxy-2-sulfamino-a-D-glucopyranoside sodium salt substrate (Carbosynth Limited, # EM06602) diluted in the assay buffer (0.03 M sodium acetate, 0.12 M NaCl, pH 6.5) was mixed with 10 μL of 40 nM SGSH. The first reaction was incubated for 17 hr at 37° C. and then terminated with 10 μL of 0.2 M phosphate-citrate buffer, pH 6.7. Next, the second reaction was initiated by adding 10 μL (0.5 U) of yeast α-Glucosidase (Sigma, # G0660-750UN), incubated for 24 hr at 37° C., and stopped with the addition of 100 μL of 0.5 M sodium carbonate buffer, pH 10.3. Fluorescence of the reaction solution was then measured (excitation at 365 nm and emission at 450 nm). A 4-Methylumbelliferone standard curve was fit by linear regression to calculate the amount of product and verified as less than 10% of total substrate cleavage. Specific activity (fmol product/min/pmol SGSH) was calculated by dividing the amount of product by the reaction time and molar amount of SGSH.

The in vitro enzymatic activity assay demonstrated that SGSH-Fc fusion proteins were active and indicated that the fusion of an Fc region to SGSH does not interfere with enzymatic activity (FIG. 15).

SGSH knockout (KO) cells were generated using CRISPR/CAS9 to provide a cellular system to test the cellular activity of the engineered SGSH-Fc polypeptides. HEK 293T cells (ATCC) were transfected with CRISPR/CAS9 pCas-Guide-EF1a-GFP vector (Origene) containing guide sequences targeted to exon 2 upstream of the reactive cysteine site that generates the formylglycine in human SGSH. To identify SGSH KO cells, single cell clones were grown and cell lysates were subjected to the in vitro SGSH enzyme assay described above. Briefly, the in vitro activity assay was performed using 12.5, 25, 50 and 100 μs cell lysate in lead acetate assay buffer pH 5.0 (100 mM sodium acetate, 10 mM lead acetate). The reaction was started by combining 20 μL normalized cell lysate (in water) with 1 mM substrate in 10 IAL lead acetate buffer (3×) and incubated at 37° C. for seventeen hours. This first reaction was stopped by the addition of 70 μL 4× citrate phosphate buffer pH 6.7 plus 0.5 U NAGLU (Sigma). The reaction proceeded for 24 hours at 37° C. and was then stopped by the addition of 100 μL 0.5 M sodium carbonate pH 10.3. SGSH activity in HEK293T CRISPR clones was compared to recombinant SGSH (R&D) used as an assay standard, HEK wild-type (WT) lysates, and HEK cell lysates over-expressing SGSH. Clones with enzyme activity levels comparable to background signal were sequence verified after mini-Topo (ThermoFisher) cloning and confirmed as KO clones. Subsequent cell assays used three unique and verified SGSH KO clones and three independent batches of WT HEK293T cells.

To test the cellular activity of naked SGSH enzyme or SGSH-Fc fusion proteins, an LC-MS/MS-based glycomic assay was developed that allows monitoring of the amount of substrate accumulation (heparan sulfate) as an indicator of SGSH activity. Substrate accumulation was measured in SGSH KO cells and WT HEK293T cells. SGSH KO cells and WT HEK293T cells were cultured for 24 hours, and cells were washed three times with PBS, pelleted, and frozen. Cell pellets were sonicated in disaccharide digestion buffer (111 mM NH₄OAc, 11 mM CaOAc, pH 7.0). Protein concentration was measured using BCA assay (Pierce). Total protein (100 μg) was added to 100 μL digestion buffer with 2 mM DTT, 1.25 mIU Heparinase I (Galen), 1.25 mIU Heparinase II (Galen), and 1.25 mIU Heparinase III (Galen). Heparan sulfate digestion was complete after three hours at 30° C., after which 20 ng of internal standard (4UA-2S-G1cNCOEt-6S HD009 [Galen]) was added to each sample. The enzymes were deactivated by the addition of 6 μL of 250 mM EDTA, and samples were boiled at 95° C. for 10 minutes. Samples were then centrifuged 16,000×G for 5 minutes at room temperature. Supernatant was transferred to an Amicon Ultra 30 KD centrifugal filter (Millipore) and centrifuged at 14,000×G for 15 minutes. Disaccharides were concentrated in the flow through and were resuspended in a mixture of [1:1, v/v] assay buffer: acetonitrile which was then transferred to mass-spectrometry vials for further analysis.

GAG analyses were performed by liquid chromatography (Shimadzu Nexera X2 system, Shimadzu Scientific Instrument, Columbia, Md., USA) coupled to electrospray mass spectrometry (Sciex 6500+ QTRAP, Sciex, Framingham, Mass., USA). For each analysis, 10 μL of sample was injected on a ACQUITY UPLC BEH Amide 1.7 μm, 2.1×150 mm column (Waters Corporation, Milford, Mass., USA) using a flow rate of 0.4 mL/min with column temperature at 50° C. Mobile phase A consisted of water with 10 mM ammonium formate and 0.1% formic acid. Mobile phase B consisted of acetonitrile with 0.1% formic acid. The gradient was programmed as follows: 0.0-1.0 min at 85% B, 1.0-5.0 min from 85% B to 50% B, 5.0-6.0 min 50% B to 85% B, 6-8.0 min hold at 85% B. Electrospray ionization was performed in the negative-ion mode applying the following settings: curtain gas at 30; collision gas was set at medium; ion spray voltage at −4500; temperature at 450; ion source gas 1 at 50; ion source gas 2 at 60. Data acquisition was performed using Analyst 1.6.3 (Sciex) in multiple reaction monitoring mode (MRM), with dwell time 25 (msec). Collision energy at −30; declustering potential at −80; entrance potential at −10; collision cell exit potential at −10. GAGs were detected as [M−H]− using the following MRM transitions: D0A0 at m/z 378.1>87.0; D0a0 at m/z 378.1>175.0; D0S0 at m/z 416.1>138.0; D0a4 at m/z 458.1>300.0; D0A6,D2A0, D0a6, D2a0 at m/z 458.1>97.0; D0S6, D2S0 at m/z 496.0>416.1; D2a4, D2a6, D0a10, D2A6 at m/z 538.0>458.0; D0S6 at m/z 575.95>97.0 4UA-2S-G1cNCOEt-6S at m/z 472.0 (fragment ion) >97.0 was used as internal standard (I. S.). GAGs were identified based on their retention times and MRM transitions match to commercially available reference standards (Iduron Ltd, Manchester, UK). Quantification was performed using MultiQuant 3.0.2 (Sciex) by the area ratio to I.S. GAGs were normalized to total protein amount. Protein concentration was measured using BCA assay (Pierce).

Significant substrate accumulation, as reflected by the amount of disaccharides observed after digestion of heparan sulfate, was seen in SGSH KO cells compared to control cell lines (FIG. 16). Using the LC-MS/MS-based glycomic assay, it was established that treatment of cells with a TfR-binding SGSH-Fc fusion protein reduced the levels of heparan sulfate-derived disaccharides back to that seen in wild-type cells (FIG. 17). Together, these data demonstrate that SGSH-Fc fusion proteins maintain enzymatic activity and can reduce substrate accumulation in SGSH KO cells.

Example 8. In Vitro Assay for SGSH Activity

This example provides an alternative in vitro activity assay for SGSH-Fc fusion proteins. The assay is adapted from Karpova et al., J. Inherit. Metab. Dis., 19:278-285 (1996).

The standard reaction mixtures consisted of 10-15 μg of protein and 20 μL MU-α-GlcN5 (5 or 10 mmol/L, respectively) in Michaelis' barbital sodium acetate buffer, pH 6.5 (29 mmol/L sodium barbital, 29 mmol/L sodium acetate, 0.68% (w/v) NaCl, 0.02% (w/v) sodium azide; adjusted to pH 6.5 with HCl) and the reaction mixtures were incubated for 17 h at 37° C. MU-α-GlcNS is available from Moscerdam Substrates. After the first incubation, 6 μl twice-concentrated McIlvain's phosphate/citrate buffer, pH 6.7, containing 0.02% sodium azide and 10 μl (0.1 U) yeast a-glucosidase (Sigma) in water were added and a second incubation of 24 h at 37° C. was carried out. Long incubations at 37° C. (17-24 h) were carried out in 96-well plates which were sealed airtight with broad sticky tape, limiting evaporation to <15%. Next, 200 μL 0.5 mol/L Na₂CO₃/NaHCO₃, pH 10.7, was added, and the fluorescence of the released 4-methylumbelliferone (MU) was measured on a Fluoroskan (Titertek) fluorimeter. Protein was determined as described previously (van Diggelen et al., Clin. Chim. Acta., 187:131-139 (1990)).

Example 9. Modified Fc Polypeptides that Bind to TfR

This example describes modifications to Fc polypeptides to confer transferrin receptor (TfR) binding and transport across the blood-brain barrier (BBB).

Unless otherwise indicated, the positions of amino acid residues in this section are numbered based on EU index numbering for a human IgG1 wild-type Fc region.

Generation and Characterization of Fc Polypeptides Comprising Modifications at Positions 384, 386, 387, 388, 389, 390, 413, 416, and 421 (CH3C clones)

Yeast libraries containing Fc regions having modifications introduced into positions including amino acid positions 384, 386, 387, 388, 389, 390, 413, 416, and 421 were generated as described below. Illustrative clones that bind to TfR are shown in Tables 4 and 5.

After an additional two rounds of sorting, single clones were sequenced and four unique sequences were identified. These sequences had a conserved Trp at position 388, and all had an aromatic residue (i.e., Trp, Tyr, or His) at position 421. There was a great deal of diversity at other positions.

The four clones selected from the library were expressed as Fc fusions to Fab fragments in CHO or 293 cells, and purified by Protein A and size-exclusion chromatography, and then screened for binding to human TfR in the presence or absence of holo-Tf by ELISA. The clones all bound to human TfR and the binding was not affected by the addition of excess (5 μM) holo-Tf. Clones were also tested for binding to 293F cells, which endogenously express human TfR. The clones bound to 293F cells, although the overall binding was substantially weaker than the high-affinity positive control.

Next, it was tested whether clones could internalize in TfR-expressing cells using clone CH3C.3 as a test clone. Adherent HEK 293 cells were grown in 96-well plates to about 80% confluence, media was removed, and samples were added at 1 μM concentrations: clone CH3C.3, anti-TfR benchmark positive control antibody (Ab204), anti-BACE1 benchmark negative control antibody (Ab107), and human IgG isotype control (obtained from Jackson Immunoresearch). The cells were incubated at 37° C. and 8% CO₂ concentration for 30 minutes, then washed, permeabilized with 0.1% Triton™ X-100, and stained with anti-human-IgG-Alexa Fluor® 488 secondary antibody. After additional washing, the cells were imaged under a high content fluorescence microscope (i.e., an Opera Phenix™ system), and the number of puncta per cell was quantified. At 1 μM, clone CH3C.3 showed a similar propensity for internalization to the positive anti-TfR control, while the negative controls showed no internalization.

Further Engineering of Clones

Additional libraries were generated to improve the affinity of the initial hits against human TfR using a soft randomization approach, wherein DNA oligos were generated to introduce soft mutagenesis based on each of the original four hits. Additional clones were identified that bound TfR and were selected. The selected clones fell into two general sequence groups. Group 1 clones (i.e., clones CH3C.18, CH3C.21, CH3C.25, and CH3C.34) had a semi-conserved Leu at position 384, a Leu or His at position 386, a conserved and a semi-conserved Val at positions 387 and 389, respectively, and a semi-conserved P-T-W motif at positions 413, 416, and 421, respectively. Group 2 clones had a conserved Tyr at position 384, the motif TXWSX at positions 386-390, and the conserved motif S/T-E-F at positions 413, 416, and 421, respectively. Clones CH3C.18 and CH3C.35 were used in additional studies as representative members of each sequence group.

Epitope Mapping

To determine whether the engineered Fc regions bound to the apical domain of TfR, TfR apical domain was expressed on the surface of phage. To properly fold and display the apical domain, one of the loops had to be truncated and the sequence needed to be circularly permuted. Clones CH3C.18 and CH3C.35 were coated on ELISA plates and a phage ELISA protocol was followed. Briefly, after washing and blocking with 1% PBSA, dilutions of phage displaying were added and incubated at room temperature for 1 hour. The plates were subsequently washed and anti-M13-HRP was added, and after additional washing the plates were developed with TMB substrate and quenched with 2N H₂SO₄. Both clones CH3C.18 and CH3C.35 bound to the apical domain in this assay.

Paratope Mapping

To understand which residues in the Fc domain were most important for TfR binding, a series of mutant clone CH3C.18 and clone CH3C.35 Fc regions was created in which each mutant had a single position in the TfR binding register mutated back to wild-type. The resulting variants were expressed recombinantly as Fc-Fab fusions and tested for binding to human or cyno TfR. For clone CH3C.35, positions 388 and 421 were important for binding; reversion of either of these to wild-type completely ablated binding to human TfR.

Binding Characterization of Maturation Clones

Binding ELISAs were conducted with purified Fc-Fab fusion variants with human or cyno TfR coated on the plate, as described above. The variants from the clone CH3C.18 maturation library, clone CH3C.3.2-1, clone CH3C.3.2-5, and clone CH3C.3.2-19, bound human and cyno TfR with approximately equivalent EC50 values, whereas the parent clones CH3C.18 and CH3C.35 had greater than 10-fold better binding to human versus cyno TfR.

Next, it was tested whether the modified Fc polypeptides internalized in human and monkey cells. Using the protocol described above, internalization in human HEK 293 cells and rhesus LLC-MK2 cells was tested. The variants that similarly bound human and cyno TfR, clones CH3C.3.2-5 and CH3C.3.2-19, had significantly improved internalization in LLC-MK2 cells as compared with clone CH3C.35.

Additional Engineering of Clones

Additional engineering to further affinity mature clones CH3C.18 and CH3C.35 involved adding additional mutations to the positions that enhanced binding through direct interactions, second-shell interactions, or structure stabilization. This was achieved via generation and selection from an “NNK walk” or “NNK patch” library. The NNK walk library involved making one-by-one NNK mutations of residues that are near to the paratope. By looking at the structure of Fc bound to FcγRI (PDB ID: 4W40), 44 residues near the original modification positions were identified as candidates for interrogation. Specifically, the following residues were targeted for NNK mutagenesis: K248, R255, Q342, R344, E345, Q347, T359, K360, N361, Q362, S364, K370, E380, E382, S383, G385, Y391, K392, T393, D399, S400, D401, S403, K409, L410, T411, V412, K414, S415, Q418, Q419, G420, V422, F423, S424, S426, Q438, S440, S442, L443, S444, P4458, G446, and K447. The 44 single point NNK libraries were generated using Kunkel mutagenesis, and the products were pooled and introduced to yeast via electroporation, as described above for other yeast libraries.

The combination of these mini-libraries (each of which had one position mutated, resulting in 20 variants) generated a small library that was selected using yeast surface display for any positions that lead to higher affinity binding. Selections were performed as described above, using TfR apical domain proteins. After three rounds of sorting, clones from the enriched yeast library were sequenced, and several “hot-spot” positions were identified where certain point mutations significantly improved the binding to apical domain proteins. For clone CH3C.35, these mutations included E380 (mutated to Trp, Tyr, Leu, or Gln) and S415 (mutated to Glu). The sequences of the clone CH3C.35 single and combination mutants are set forth in SEQ ID NOS:27-38. For clone CH3C.18, these mutations included E380 (mutated to Trp, Tyr, or Leu) and K392 (mutated to Gln, Phe, or His). The sequences of the clone CH3C.18 single mutants are set forth in SEQ ID NOS:21-26.

Additional Maturation Libraries to Improve Clone CH3C.35 Affinity

An additional library to identify combinations of mutations from the NNK walk library, while adding several additional positions on the periphery of these, was generated as described for previous yeast libraries. In this library, the YxTEWSS (SEQ ID NO:242) and TxxExxxxF motifs were kept constant, and six positions were completely randomized: E380, K392, K414, S415, S424, and S426. Positions E380 and S415 were included because they were “hot spots” in the NNK walk library. Positions K392, S424, and S426 were included because they make up part of the core that may position the binding region, while K414 was selected due to its adjacency to position 415.

This library was sorted, as previously described, with the cyno TfR apical domain only. The enriched pool was sequenced after five rounds, and the sequences of the modified regions of the identified unique clones are set forth in SEQ ID NOS:42-59.

The next libraries were designed to further explore acceptable diversity in the main binding paratope. Each of the original positions (384, 386, 387, 388, 389, 390, 413, 416, and 421) plus the two hot spots (380 and 415) were individually randomized with NNK codons to generate a series of single-position saturation mutagenesis libraries on yeast. In addition, each position was individually reverted to the wild-type residue, and these individual clones were displayed on yeast. It was noted that positions 380, 389, 390, and 415 were the only positions that retained substantial binding to TfR upon reversion to the wild-type residue (some residual but greatly diminished binding was observed for reversion of 413 to wild-type).

The single-position NNK libraries were sorted for three rounds against the human TfR apical domain to collect the top ˜5% of binders, and then at least 16 clones were sequenced from each library. The results indicate what amino acids at each position can be tolerated without significantly reducing binding to human TfR, in the context of clone CH3C.35. A summary is below:

Position 380: Trp, Leu, or Glu;

Position 384: Tyr or Phe;

Position 386: Thr only;

Position 387: Glu only;

Position 388: Trp only;

Position 389: Ser, Ala, or Val (although the wild type Asn residue seems to retain some binding, it did not appear following library sorting);

Position 390: Ser or Asn;

Position 413: Thr or Ser;

Position 415: Glu or Ser;

Position 416: Glu only; and

Position 421: Phe only.

The above residues, when substituted into clone CH3C.35 as single changes or in combinations, represent paratope diversity that retains binding to TfR apical domain. Clones having mutations at these positions include those shown in Table 5, and the sequences of the CH3 domains of these clones are set forth in SEQ ID NOS:34-38, 58, and 60-90.

Example 10. Additional Fc Positions that can be Modified to Confer TfR Binding

Additional modified Fc polypeptides that bind to transferrin receptor (TfR) were generated having modifications at alternative sites in the Fc region, e.g., at the following positions:

positions 274, 276, 283, 285, 286, 287, 288, and 290 (CH2A2 clones);

positions 266, 267, 268, 269, 270, 271, 295, 297, 298, and 299 (CH2C clones);

positions 268, 269, 270, 271, 272, 292, 293, 294, and 300 (CH2D clones);

positions 272, 274, 276, 322, 324, 326, 329, 330, and 331 (CH2E3 clones); or positions 345, 346, 347, 349, 437, 438, 439, and 440 (CH3B clones).

Illustrative CH3B clones that bind to TfR are set forth in SEQ ID NOS:124-128. Illustrative CH2A2 clones that bind to TfR are set forth in SEQ ID NOS:129-133. Illustrative CH2C clones that bind to TfR are set forth in SEQ ID NOS:134-138. Illustrative CH2D clones that bind to TfR are set forth in SEQ ID NOS:139-143. Illustrative CH2E3 clones that bind to TfR are set forth in SEQ ID NOS:144-148.

Example 11. Methods

Generation of Phage-Display Libraries

A DNA template coding for the wild-type human Fc sequence was synthesized and incorporated into a phagemid vector. The phagemid vector contained an ompA or pelB leader sequence, the Fc insert fused to c-Myc and 6×His (SEQ ID NO:241) epitope tags, and an amber stop codon followed by M13 coat protein pIII.

Primers containing “NNK” tricodons at the desired positions for modifications were generated, where N is any DNA base (i.e., A, C, G, or T) and K is either G or T. Alternatively, primers for “soft” randomization were used, where a mix of bases corresponding to 70% wild-type base and 10% of each of the other three bases was used for each randomization position. Libraries were generated by performing PCR amplification of fragments of the Fc region corresponding to regions of randomization and then assembled using end primers containing SfiI restriction sites, then digested with SfiI and ligated into the phagemid vectors. Alternatively, the primers were used to conduct Kunkel mutagenesis. The ligated products or Kunkel products were transformed into electrocompetent E. coli cells of strain TG1 (obtained from Lucigen®). The E. coli cells were infected with M13K07 helper phage after recovery and grown overnight, after which library phage were precipitated with 5% PEG/NaCl, resuspended in 15% glycerol in PBS, and frozen until use. Typical library sizes ranged from about 10⁹ to about 10¹¹ transformants. Fc-dimers were displayed on phage via pairing between pIII-fused Fc and soluble Fc not attached to pIII (the latter being generated due to the amber stop codon before pIII).

Generation of Yeast-Display Libraries

A DNA template coding for the wild-type human Fc sequence was synthesized and incorporated into a yeast display vector. For CH2 and CH3 libraries, the Fc polypeptides were displayed on the Aga2p cell wall protein. Both vectors contained prepro leader peptides with a Kex2 cleavage sequence, and a c-Myc epitope tag fused to the terminus of the Fc.

Yeast display libraries were assembled using methods similar to those described for the phage libraries, except that amplification of fragments was performed with primers containing homologous ends for the vector. Freshly prepared electrocompetent yeast (i.e., strain EBY100) were electroporated with linearized vector and assembled library inserts. Electroporation methods will be known to one of skill in the art. After recovery in selective SD-CAA media, the yeast were grown to confluence and split twice, then induced for protein expression by transferring to SG-CAA media. Typical library sizes ranged from about 10⁷ to about 10⁹ transformants. Fc-dimers were formed by pairing of adjacently displayed Fc monomers.

General Methods for Phage Selection

Phage methods were adapted from Phage Display: A Laboratory Manual (Barbas, 2001). Additional protocol details can be obtained from this reference.

Plate Sorting Methods

Antigen was coated on MaxiSorp® microtiter plates (typically 1-10 μg/mL) overnight at 4° C. The phage libraries were added into each well and incubated overnight for binding. Microtiter wells were washed extensively with PBS containing 0.05% Tween® 20 (PBST) and bound phage were eluted by incubating the wells with acid (typically 50 mM HCl with 500 mM KCl, or 100 mM glycine, pH 2.7) for 30 minutes. Eluted phage were neutralized with 1 M Tris (pH 8) and amplified using TG1 cells and M13/KO7 helper phage and grown overnight at 37° C. in 2YT media containing 50 μg/mL carbenacillin and 50 ug/mL Kanamycin. The titers of phage eluted from a target-containing well were compared to titers of phage recovered from a non-target-containing well to assess enrichment. Selection stringency was increased by subsequently decreasing the incubation time during binding and increasing washing time and number of washes.

Bead Sorting Methods

Antigen was biotinylated through free amines using NHS-PEG4-Biotin (obtained from Pierce™). For biotinylation reactions, a 3- to 5-fold molar excess of biotin reagent was used in PBS. Reactions were quenched with Tris followed by extensive dialysis in PBS. The biotinylated antigen was immobilized on streptavidin-coated magnetic beads, (i.e., M280-streptavidin beads obtained Thermo Fisher). The phage display libraries were incubated with the antigen-coated beads at room temperature for 1 hour. The unbound phage were then removed and beads were washed with PBST. The bound phage were eluted by incubating with 50 mM HCl containing 500 mM KCl (or 0.1 M glycine, pH 2.7) for 30 minutes, and then neutralized and propagated as described above for plate sorting.

After three to five rounds of panning, single clones were screened by either expressing Fc on phage or solubly in the E. coli periplasm. Such expression methods will be known to one of skill in the art. Individual phage supernatants or periplasmic extracts were exposed to blocked ELISA plates coated with antigen or a negative control and were subsequently detected using HRP-conjugated goat anti-Fc (obtained from Jackson Immunoresearch) for periplasmic extracts or anti-M13 (GE Healthcare) for phage, and then developed with TMB reagent (obtained from Thermo Fisher). Wells with OD450 values greater than around 5-fold over background were considered positive clones and sequenced, after which some clones were expressed either as a soluble Fc fragment or fused to Fab fragments

General Methods for Yeast Selection

Bead Sorting (Magnetic-Assisted Cell Sorting (MACS)) Methods

MACS and FACS selections were performed similarly to as described in Ackerman et al., Biotechnol. Prog., 25(3):774 (2009). Streptavidin magnetic beads (e.g., M-280 streptavidin beads from ThermoFisher) were labeled with biotinylated antigen and incubated with yeast (typically 5-10× library diversity). Unbound yeast were removed, the beads were washed, and bound yeast were grown in selective media and induced for subsequent rounds of selection.

Fluorescence-Activated Cell Sorting (FACS) Methods

Yeast were labeled with anti-c-Myc antibody to monitor expression and biotinylated antigen (concentration varied depending on the sorting round). In some experiments, the antigen was pre-mixed with streptavidin-Alexa Fluor® 647 in order to enhance the avidity of the interaction. In other experiments, the biotinylated antigen was detected after binding and washing with streptavidin-Alexa Fluor® 647. Singlet yeast with binding were sorted using a FACS Aria III cell sorter. The sorted yeast were grown in selective media then induced for subsequent selection rounds.

After an enriched yeast population was achieved, yeast were plated on SD-CAA agar plates and single colonies were grown and induced for expression, then labeled as described above to determine their propensity to bind to the target. Positive single clones were subsequently sequenced for binding antigen, after which some clones were expressed either as a soluble Fc fragment or as fused to Fab fragments.

General Methods for Screening

Screening by ELISA

Clones were selected from panning outputs and grown in individual wells of 96-well deep-well plates. The clones were either induced for periplasmic expression using autoinduction media (obtained from EMD Millipore) or infected with helper phage for phage-display of the individual Fc variants on phage. ELISA plates were coated with antigen, typically at 0.5 mg/mL overnight, then blocked with 1% BSA before addition of phage or periplasmic extracts. After a 1-hour incubation and washing off unbound protein, HRP-conjugated secondary antibody was added (i.e., anti-Fc or anti-M13 for soluble Fc or phage-displayed Fc, respectively) and incubated for 30 minutes. The plates were washed again, and then developed with TMB reagent and quenched with 2N sulfuric acid. Absorbance at 450 nm was quantified using a plate reader (BioTek®) and binding curves were plotted using Prism software where applicable. In some assays, soluble transferrin or other competitor was added during the binding step, typically at significant molar excess.

Screening by Flow Cytometry

Fc variant polypeptides (expressed either on phage, in periplasmic extracts, or solubly as fusions to Fab fragments) were added to cells in 96-well V-bottom plates (about 100,000 cells per well in PBS+1% BSA (PBSA)), and incubated at 4° C. for 1 hour. The plates were subsequently spun and the media was removed, and then the cells were washed once with PBSA. The cells were resuspended in PBSA containing secondary antibody (typically goat anti-human-IgG-Alexa Fluor® 647 (obtained from Thermo Fisher)). After 30 minutes, the plates were spun and the media was removed, the cells were washed 1-2 times with PBSA, and then the plates were read on a flow cytometer (i.e., a FACSCanto™ II flow cytometer). Median fluorescence values were calculated for each condition using FlowJo software and binding curves were plotted with Prism software.

Example 12. Selection of TfR-Binding Polypeptide Affinity

This example describes the relationship between the affinity of a TfR-binding polypeptide for a transferrin receptor (TfR) and the resulting brain exposure to a therapeutic agent that is linked to the TfR-binding polypeptide.

FIG. 18 shows that brain exposure to a therapeutic agent (as assessed by determining the area under the curve (AUC) of brain concentration vs. time) was shortened when the therapeutic agent was linked to a polypeptide that had a relatively stronger affinity for TfR. In particular, brain exposure was substantially shortened when the therapeutic agent was linked to a polypeptide that had an affinity for TfR that was stronger than about 250 nM.

As shown in FIG. 19, higher maximum concentration (C_(max)) in the brain was observed when a therapeutic agent was linked to a polypeptide that had a relatively stronger affinity for TfR. In particular, brain C_(max) values were significantly higher when the TfR-binding polypeptide had an affinity that was stronger than about 250 nM.

FIG. 20 shows the ratio of brain C_(max) to plasma concentration of a therapeutic agent when linked to polypeptides having a range of affinities for TfR.

Methods

Generation of TfR^(ms/hu) KI

Methods for generating knock-in/knock-out mice have been published in the literature and are well known to those with skill in the art. In summary, TfR^(ms/hu) KI mice were generated using CRISPR/Cas9 technology to express human Tfrc apical domain within the murine Tfrc gene; the resulting chimeric TfR was expressed in vivo under the control of the endogenous promoter. As described in International Patent Publication No. WO 2018/152285, which is incorporated by reference in its entirety herein, C57B16 mice were used to generate a knock-in of the human apical TfR mouse line via pronuclear microinjection into single cell embryos, followed by embryo transfer to pseudo pregnant females. Specifically, Cas9, single guide RNAs and a donor DNA were introduced into the embryos. The donor DNA comprised a human apical domain coding sequence that had been codon optimized for expression in mouse. The apical domain coding sequence was flanked with a left and a right homology arm. The donor sequence was designed such that the apical domain was inserted after the fourth mouse exon, and was immediately flanked at the 3′ end by the ninth mouse exon. A founder male from the progeny of the female that received the embryos was bred to wild-type females to generate F1 heterozygous mice. Homozygous mice were subsequently generated from breeding of F1 generation heterozygous mice.

Mouse PK/PD

For PK/PD evaluation, TfR^(ms/hu) KI mice were systemically dosed one time via tail vein injection at 50 mg/kg. Prior to perfusion, blood was collected in EDTA plasma tubes via cardiac puncture and spun at 14,000 rpm for 5 minutes. Plasma was then isolated for subsequent PK/PD analysis. Brains were extracted after perfusion and hemi-brains were isolated for homogenization in 10× by tissue weight of 1% NP-40 in PBS (for PK) or 5 M GuHCl (for PD).

Engineered TfR-binding polypeptide concentrations in mouse plasma and brain lysates were quantified using a generic human IgG assay (MSD human IgG kit # K150JLD) following the manufacturer's instructions. Briefly, pre-coated plates were blocked for 30 minutes with MSD Blocker A. Plasma samples were diluted 1:10,000 using a Hamilton Nimbus liquid handler and added in duplicate to the blocked plates. Brain samples were homogenized in 1% NP-40 lysis buffer and lysates diluted 1:10 for PK analysis. Dosing solutions were also analyzed on the same plate to confirm the correct dosage. The standard curve, 0.78-200 ng/mL IgG, was fit using a four-parameter logistic regression.

Example 13. Binding Characterization of CH3C Variants Using Biacore™

The affinity of clone variants for recombinant TfR apical domain was determined by surface plasmon resonance using a Biacore™ T200 instrument. Biacore™ Series S CM5 sensor chips were immobilized with anti-human Fab (human Fab capture kit from GE Healthcare). 5 μg/mL of polypeptide-Fab fusion was captured for 1 minute on each flow cell and serial 3-fold dilutions of human or cyno apical domain were injected at a flow rate of 30 μL/min at room temperature. Each sample was analyzed with a 45-second association and a 3-minute dissociation. After each injection, the chip was regenerated using 10 mM glycine-HCl (pH 2.1). Binding response was corrected by subtracting the RU from a flow cell capturing an irrelevant IgG at similar density. Steady-state affinities were obtained by fitting the response at equilibrium against the concentration using Biacore™ T200 Evaluation Software v3.1.

To determine the affinity of clone variants for recombinant TfR ectodomain (ECD), Biacore™ Series S CM5 sensor chips were immobilized with streptavidin. Biotinylated human or cyno TfR ECD was captured for 1 minute on each flow cell and serial 3-fold dilutions of clone variants were injected at a flow rate of 30 μL/min at room temperature. Each sample was analyzed with a 45-second association and a 3-minute dissociation. The binding response was corrected by subtracting the RU from a flow cell without TfR ECD at a similar density. Steady-state affinities were obtained by fitting the response at equilibrium against the concentration using Biacore™ T200 Evaluation Software v3.1.

The binding affinities are summarized in Table 6. Affinities were obtained by steady-state fitting.

TABLE 6 Binding affinities for exemplary CH3C variants. Human Cyno Human Cyno apical apical Clone TfR (μM) TfR (μM) TfR (μM) TfR (μM) CH3C.35.19.mono 0.4 5.9 0.37 5.6 CH3C.35.20.mono 0.25 6.7 0.17 8 CH3C.35.21.mono 0.1 2.1 0.12 2.2 CH3C.35.24.mono 0.29 3.3 0.23 3 CH3C.35.21.11.mono 0.24 4 0.13 2.2 CH3C.35.21.16.mono 0.18 1.8 0.12 1.9 CH3C.35.21.17.mono 0.3 2.9 0.13 2.6 CH3C.35.mono 0.61 >10 0.61 >10 CH3C.35.N153.mono 0.42 >10 0.95 >10 CH3C.35.bi 0.22 >2 not tested not tested CH3C.35.N153.bi 0.37 3.3 not tested not tested CH3C.3.2-19.bi 5.2 5.6 not tested not tested CH3C.35.19.bi 0.074 1.5 not tested not tested CH3C.35.20.bi 0.054 1.7 not tested not tested CH3C.35.21.bi 0.049 0.7 not tested not tested CH3C.35.24.bi 0.061 0.65 not tested not tested

Example 14. Brain and Plasma PKPD of Polypeptide-Fab Fusions in TfR^(ms/hu) Mice: CH3C.35.21, CH3C.35.20, CH3C.35, CH3C.35.23, CH3C.35.23.3

To evaluate the impact of TfR binding affinity for PK and brain uptake, anti-BACE1 Ab153 and TfR-binding polypeptide fusions (CH3C.35.21:Ab153, CH3C.35.20:Ab153, CH3C.35:Ab153 fusions) were generated that differed in their binding affinity to apical human TfR as measured by Biacore™. The binding affinities of CH3C.35.21:Ab153, CH3C.35.20:Ab153, CH3C.35:Ab153 fusions to human TfR are 100 nM, 170 nM and 620 nM, respectively. TfR^(ms/hu) knock-in mice were systemically administered either Ab153 or the polypeptide-Fab fusions at 50 mg/kg, and plasma PK and brain PKPD was evaluated at 1, 3, and 7 days post-dose. Brain and plasma PKPD analysis was conducted as described above. Due to expression of TfR on peripheral tissues, CH3C.35.21:Ab153, CH3C.35.20:Ab153, and CH3C.35:Ab153 fusions exhibited faster clearance in plasma as compared to Ab153 alone, consistent with target-mediated clearance and indicative of in vivo TfR binding (FIG. 21A). Impressively, brain concentrations of CH3C.35.21:Ab153, CH3C.35.20:Ab153, and CH3C.35:Ab153 fusions were significantly increased compared to Ab153, achieving a maximum brain concentration of more than 30 nM at 1 day post-dose, compared to only about 3 nM for Ab153 at this same time point (FIG. 21B). The increase in brain exposure of CH3C.35.21:Ab153, CH3C.35.20:Ab153, and CH3C.35:Ab153 fusions resulted in about 55-60% lower endogenous mouse Aβ levels in brains of mice compared to Aβ levels in mice dosed with Ab153 (FIG. 21C). The lower brain Aβ levels were sustained while concentrations of CH3C.35.21:Ab153, CH3C.35.20:Ab153, and CH3C.35:Ab153 fusions remained elevated in brain, and returned to levels similar to Ab153 treated mice at when exposure was reduced by day 7. The reduction in brain exposure over time correlated with a reduction in peripheral exposure of CH3C.35.21:Ab153, CH3C.35.20:Ab153, and CH3C.35:Ab153 fusions, providing a clear PK/PD relationship in vivo (compare FIGS. 21A and 21C). Additionally, total brain TfR levels were comparable for Ab153-treated and polypeptide-Fab fusion-treated mice after this single high dose, indicating no significant impact of increased brain exposure of the polypeptide-Fab fusions to TfR expression in brain (FIG. 21D).

The amino acid substitutions for each clone described in the Tables (e.g., Table 5) dictate the amino acid substitutions at the register positions of that clone over the amino acids found in the sequence set forth in the Sequence Listing, in case of discrepancy.

It is understood that the examples and embodiments described herein are for illustrative purposes only and that various modifications or changes in light thereof will be suggested to persons skilled in the art and are to be included within the spirit and purview of this application and scope of the appended claims. The sequences of the sequence accession numbers cited herein are hereby incorporated by reference.

TABLE 4 CH3 Domain Modifications. Clone name Group 384 385 386 387 388 389 390 391 . . . 413 414 415 416 417 418 419 420 421 Wild-type n/a N G Q P E N N Y . . . D K S R W Q Q G N  1 L G L V W V G Y . . . A K S T W Q Q G W  2 Y G T V W S H Y . . . S K S E W Q Q G Y  3 Y G T E W S Q Y . . . E K S D W Q Q G H  4 V G T P W A L Y . . . L K S E W Q Q G W 17 2 Y G T V W S K Y . . . S K S E W Q Q G F 18 1 L G H V W A V Y . . . P K S T W Q Q G W 21 1 L G L V W V G Y . . . P K S T W Q Q G W 25 1 M G H V W V G Y . . . D K S T W Q Q G W 34 1 L G L V W V F S . . . P K S T W Q Q G W 35 2 Y G T E W S S Y . . . T K S E W Q Q G F 44 2 Y G T E W S N Y . . . S K S E W Q Q G F 51 1/2 L G H V W V G Y . . . S K S E W Q Q G W  3.1-3 1 L G H V W V A T . . . P K S T W Q Q G W  3.1-9 1 L G P V W V H T . . . P K S T W Q Q G W  3.2-5 1 L G H V W V D Q . . . P K S T W Q Q G W  3.2-19 1 L G H V W V N Q . . . P K S T W Q Q G W  3.2-1 1 L G H V W V N F . . . P K S T W Q Q G W

TABLE 5 Additional CH3 Domain Modifications. Clone name 378 379 380 381 382 383 384 385 386 387 388 389 390 391 Wild-type A V E W E S N G Q P E N N Y 35.20.1 . . . . . . F . T E W S S . 35.20.2 . . . . . . Y . T E W A S . 35.20.3 . . . . . . Y . T E W V S . 35.20.4 . . . . . . Y . T E W S S . 35.20.5 . . . . . . F . T E W A S . 35.20.6 . . . . . . F . T E W V S . 35.21.a.1 . . W . . . F . T E W S S . 35.21.a.2 . . W . . . Y . T E W A S . 35.21.a.3 . . W . . . Y . T E W V S . 35.21.a.4 . . W . . . Y . T E W S S . 35.21.a.5 . . W . . . F . T E W A S . 35.21.a.6 . . W . . . F . T E W V S . 35.23.1 . . . . . . F . T E W S . . 35.23.2 . . . . . . Y . T E W A . . 35.23.3 . . . . . . Y . T E W V . . 35.23.4 . . . . . . Y . T E W S . . 35.23.5 . . . . . . F . T E W A . . 35.23.6 . . . . . . F . T E W V . . 35.24.1 . . W . . . F . T E W S . . 35.24.2 . . W . . . Y . T E W A . . 35.24.3 . . W . . . Y . T E W V . . 35.24.4 . . W . . . Y . T E W S . . 35.24.5 . . W . . . F . T E W A . . 35.24.6 . . W . . . F . T E W V . . 35.21.17.1 . . L . . . F . T E W S S . 35.21.17.2 . . L . . . Y . T E W A S . 35.21.17.3 . . L . . . Y . T E W V S . 35.21.17.4 . . L . . . Y . T E W S S . 35.21.17.5 . . L . . . F . T E W A S . 35.21.17.6 . . L . . . F . T E W V S . 35.20 . . . . . . Y . T E W S S . 35.21 . . W . . . Y . T E W S S . 35.22 . . W . . . Y . T E W S . . 35.23 . . . . . . Y . T E W S . . 35.24 . . W . . . Y . T E W S . . 35.21.17 . . L . . . Y . T E W S S . 35.N390 . . . . . . Y . T E W S . . Clone name 392 411 412 413 414 415 416 417 418 419 420 421 422 423 Wild-type K T V D K S R W Q Q G N V F 35.20.1 . . . T . E E . . . . F . . 35.20.2 . . . T . E E . . . . F . . 35.20.3 . . . T . E E . . . . F . . 35.20.4 . . . S . E E . . . . F . . 35.20.5 . . . T . E E . . . . F . . 35.20.6 . . . T . E E . . . . F . . 35.21.a.1 . . . T . E E . . . . F . . 35.21.a.2 . . . T . E E . . . . F . . 35.21.a.3 . . . T . E E . . . . F . . 35.21.a.4 . . . S . E E . . . . F . . 35.21.a.5 . . . T . E E . . . . F . . 35.21.a.6 . . . T . E E . . . . F . . 35.23.1 . . . T . E E . . . . F . . 35.23.2 . . . T . E E . . . . F . . 35.23.3 . . . T . E E . . . . F . . 35.23.4 . . . S . E E . . . . F . . 35.23.5 . . . T . E E . . . . F . . 35.23.6 . . . T . E E . . . . F . . 35.24.1 . . . T . E E . . . . F . . 35.24.2 . . . T . E E . . . . F . . 35.24.3 . . . T . E E . . . . F . . 35.24.4 . . . S . E E . . . . F . . 35.24.5 . . . T . E E . . . . F . . 35.24.6 . . . T . E E . . . . F . . 35.21.17.1 . . . T . E E . . . . F . . 35.21.17.2 . . . T . E E . . . . F . . 35.21.17.3 . . . T . E E . . . . F . . 35.21.17.4 . . . S . E E . . . . F . . 35.21.17.5 . . . T . E E . . . . F . . 35.21.17.6 . . . T . E E . . . . F . . 35.20 . . . T . E E . . . . F . . 35.21 . . . T . E E . . . . F . . 35.22 . . . T . . E . . . . F . . 35.23 . . . T . E E . . . . F . . 35.24 . . . T . E E . . . . F . . 35.21.17 . . . T . E E . . . . F . . 35.N390 . . . T . . E . . . . F . .

INFORMAL SEQUENCE LISTING SEQ ID NO: Sequence Description 1 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Wild-type human Fc WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY sequence KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT positions 231-447 EU CLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTV index numbering DKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 2 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN CH2 domain sequence WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY positions 231-340 EU KCKVSNKALPAPIEKTISKAK index numbering 3 GQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQ CH3 domain sequence PENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEA Positions 341-447 EU LHNHYTQKSLSLSPGK index numbering 4 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.1 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESLGLVWVGYKTTPPVLDSDGSFFLYSKLTV AKSTWQQGWVFSCSVMHEALHNHYTQKSLSLSPGK 5 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.2 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESYGTVWSHYKTTPPVLDSDGSFFLYSKLTV SKSEWQQGYVFSCSVMHEALHNHYTQKSLSLSPGK 6 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.3 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESYGTEWSQYKTTPPVLDSDGSFFLYSKLTV EKSDWQQGHVFSCSVMHEALHNHYTQKSLSLSPGK 7 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.4 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESVGTPWALYKTTPPVLDSDGSFFLYSKLTV LKSEWQQGWVFSCSVMHEALHNHYTQKSLSLSPGK 8 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.17 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESYGTVWSKYKTTPPVLDSDGSFFLYSKLTV SKSEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 9 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.18 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESLGHVWAVYKTTPPVLDSDGSFFLYSKLTV PKSTWQQGWVFSCSVMHEALHNHYTQKSLSLSPGK 10 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.21 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESLGLVWVGYKTTPPVLDSDGSFFLYSKLTV PKSTWQQGWVFSCSVMHEALHNHYTQKSLSLSPGK 11 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.25 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESMGHVWVGYKTTPPVLDSDGSFFLYSKLT VDKSTWQQGWVFSCSVMHEALHNHYTQKSLSLSPGK 12 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.34 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESLGLVWVFSKTTPPVLDSDGSFFLYSKLTVP KSTWQQGWVFSCSVMHEALHNHYTQKSLSLSPGK 13 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESYGTEWSSYKTTPPVLDSDGSFFLYSKLTV TKSEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 14 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.44 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESYGTEWSNYKTTPPVLDSDGSFFLYSKLTV SKSEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 15 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.51 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESLGHVWVGYKTTPPVLDSDGSFFLYSKLTV SKSEWQQGWVFSCSVMHEALHNHYTQKSLSLSPGK 16 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.3.1-3 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESLGHVWVATKTTPPVLDSDGSFFLYSKLTV PKSTWQQGWVFSCSVMHEALHNHYTQKSLSLSPGK 17 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.3.1-9 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESLGPVWVHTKTTPPVLDSDGSFFLYSKLTV PKSTWQQGWVFSCSVMHEALHNHYTQKSLSLSPGK 18 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.3.2-5 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESLGHVWVDQKTTPPVLDSDGSFFLYSKLTV PKSTWQQGWVFSCSVMHEALHNHYTQKSLSLSPGK 19 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.3.2-19 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESLGHVWVNQKTTPPVLDSDGSFFLYSKLTV PKSTWQQGWVFSCSVMHEALHNHYTQKSLSLSPGK 20 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.3.2-1 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESLGHVWVNFKTTPPVLDSDGSFFLYSKLTV PKSTWQQGWVFSCSVMHEALHNHYTQKSLSLSPGK 21 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.18 variant WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVWWESLGHVWAVYKTTPPVLDSDGSFFLYSKLT VPKSTWQQGWVFSCSVMHEALHNHYTQKSLSLSPGK 22 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.18 variant WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVLWESLGHVWAVYKTTPPVLDSDGSFFLYSKLTV PKSTWQQGWVFSCSVMHEALHNHYTQKSLSLSPGK 23 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.18 variant WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVYWESLGHVWAVYKTTPPVLDSDGSFFLYSKLTV PKSTWQQGWVFSCSVMHEALHNHYTQKSLSLSPGK 24 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.18 variant WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESLGHVWAVYQTTPPVLDSDGSFFLYSKLTV PKSTWQQGWVFSCSVMHEALHNHYTQKSLSLSPGK 25 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.18 variant WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESLGHVWAVYFTTPPVLDSDGSFFLYSKLTV PKSTWQQGWVFSCSVMHEALHNHYTQKSLSLSPGK 26 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.18 variant WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESLGHVWAVYHTTPPVLDSDGSFFLYSKLTV PKSTWQQGWVFSCSVMHEALHNHYTQKSLSLSPGK 27 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.13 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVWWESLGHVWAVYKTTPPVLDSDGSFFLYSKLT VPKSTWQQGWVFSCSVMHEALHNHYTQKSLSLSPGK 28 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.14 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESLGHVWAVYQTTPPVLDSDGSFFLYSKLTV PKSTWQQGWVFSCSVMHEALHNHYTQKSLSLSPGK 29 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.15 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVWWESLGHVWAVYQTTPPVLDSDGSFFLYSKLT VPKSTWQQGWVFSCSVMHEALHNHYTQKSLSLSPGK 30 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.16 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVWWESLGHVWVNQKTTPPVLDSDGSFFLYSKLT VPKSTWQQGWVFSCSVMHEALHNHYTQKSLSLSPGK 31 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.17 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESLGHVWVNQQTTPPVLDSDGSFFLYSKLTV PKSTWQQGWVFSCSVMHEALHNHYTQKSLSLSPGK 32 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.18 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVWWESLGHVWVNQQTTPPVLDSDGSFFLYSKLT VPKSTWQQGWVFSCSVMHEALHNHYTQKSLSLSPGK 33 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.19 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVWWESYGTEWSSYKTTPPVLDSDGSFFLYSKLTV TKSEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 34 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.20 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESYGTEWSSYKTTPPVLDSDGSFFLYSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 35 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVWWESYGTEWSSYKTTPPVLDSDGSFFLYSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 36 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.22 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVWWESYGTEWSNYKTTPPVLDSDGSFFLYSKLTV TKSEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 37 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESYGTEWSNYKTTPPVLDSDGSFFLYSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 38 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.24 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVWWESYGTEWSNYKTTPPVLDSDGSFFLYSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 39 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.N163 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESYGTEWSNYKTTPPVLDSDGSFFLYSKLTV TKSEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 40 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.K165Q WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESYGTEWSSYQTTPPVLDSDGSFFLYSKLTV TKSEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 41 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.N163. WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY K165Q KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESYGTEWSNYQTTPPVLDSDGSFFLYSKLTV TKSEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 42 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21.1 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVLWESYGTEWSSYKTTPPVLDSDGSFFLYSKLTV TKSEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 43 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21.2 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVLWESYGTEWSSYRTTPPVLDSDGSFFLYSKLTVT KSEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 44 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21.3 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVLWESYGTEWSSYRTTPPVLDSDGSFFLYSKLTVT REEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 45 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21.4 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVLWESYGTEWSSYRTTPPVLDSDGSFFLYSKLTVT GEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 46 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21.5 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVLWESYGTEWSSYRTTPPVLDSDGSFFLYSKLTVT REEWQQGFVFSCWVMHEALHNHYTQKSLSLSPGK 47 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21.6 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVLWESYGTEWSSYRTTPPVLDSDGSFFLYSKLTVT KEEWQQGFVFSCWVMHEALHNHYTQKSLSLSPGK 48 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21.7 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVLWESYGTEWSSYRTTPPVLDSDGSFFLYSKLTVT REEWQQGFVFTCWVMHEALHNHYTQKSLSLSPGK 49 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21.8 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVLWESYGTEWSSYRTTPPVLDSDGSFFLYSKLTVT REEWQQGFVFTCGVMHEALHNHYTQKSLSLSPGK 50 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21.9 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVLWESYGTEWSSYRTTPPVLDSDGSFFLYSKLTVT REEWQQGFVFECWVMHEALHNHYTQKSLSLSPGK 51 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21.10 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVLWESYGTEWSSYRTTPPVLDSDGSFFLYSKLTVT REEWQQGFVFKCWVMHEALHNHYTQKSLSLSPGK 52 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21.11 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVLWESYGTEWSSYRTTPPVLDSDGSFFLYSKLTVT PEEWQQGFVFKCWVMHEALHNHYTQKSLSLSPGK 53 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21.12 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVWWESYGTEWSSYRTTPPVLDSDGSFFLYSKLTV TREEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 54 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21.13 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVWWESYGTEWSSYRTTPPVLDSDGSFFLYSKLTV TGEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 55 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21.14 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVWWESYGTEWSSYRTTPPVLDSDGSFFLYSKLTV TREEWQQGFVFTCWVMHEALHNHYTQKSLSLSPGK 56 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21.15 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVWWESYGTEWSSYRTTPPVLDSDGSFFLYSKLTV TGEEWQQGFVFTCWVMHEALHNHYTQKSLSLSPGK 57 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21.16 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVWWESYGTEWSSYRTTPPVLDSDGSFFLYSKLTV TREEWQQGFVFTCGVMHEALHNHYTQKSLSLSPGK 58 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21.17 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVLWESYGTEWSSYKTTPPVLDSDGSFFLYSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 59 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21.18 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVLWESYGTEWSSYRTTPPVLDSDGSFFLYSKLTVT KEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 60 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.20.1 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESFGTEWSSYKTTPPVLDSDGSFFLYSKLTVT KEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 61 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.20.2 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESYGTEWASYKTTPPVLDSDGSFFLYSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 62 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.20.3 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESYGTEWVSYKTTPPVLDSDGSFFLYSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 63 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.20.4 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESYGTEWSSYKTTPPVLDSDGSFFLYSKLTVS KEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 64 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.20.5 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESFGTEWASYKTTPPVLDSDGSFFLYSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 65 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.20.6 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESFGTEWVSYKTTPPVLDSDGSFFLYSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 66 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21.a.1 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVWWESFGTEWSSYKTTPPVLDSDGSFFLYSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 67 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21.a.2 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVWWESYGTEWASYKTTPPVLDSDGSFFLYSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 68 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21.a.3 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVWWESYGTEWVSYKTTPPVLDSDGSFFLYSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 69 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21.a.4 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVWWESYGTEWSSYKTTPPVLDSDGSFFLYSKLTV SKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 70 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21.a.5 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVWWESFGTEWASYKTTPPVLDSDGSFFLYSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 71 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21.a.6 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVWWESFGTEWVSYKTTPPVLDSDGSFFLYSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 72 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23.1 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESFGTEWSNYKTTPPVLDSDGSFFLYSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 73 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23.2 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESYGTEWANYKTTPPVLDSDGSFFLYSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 74 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23.3 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESYGTEWVNYKTTPPVLDSDGSFFLYSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 75 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23.4 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESYGTEWSNYKTTPPVLDSDGSFFLYSKLTV SKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 76 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23.5 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESFGTEWANYKTTPPVLDSDGSFFLYSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 77 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23.6 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESFGTEWVNYKTTPPVLDSDGSFFLYSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 78 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.24.1 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVWWESFGTEWSNYKTTPPVLDSDGSFFLYSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 79 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.24.2 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVWWESYGTEWANYKTTPPVLDSDGSFFLYSKLT VTKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 80 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.24.3 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVWWESYGTEWVNYKTTPPVLDSDGSFFLYSKLT VTKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 81 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.24.4 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVWWESYGTEWSNYKTTPPVLDSDGSFFLYSKLTV SKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 82 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.24.5 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVWWESFGTEWANYKTTPPVLDSDGSFFLYSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 83 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.24.6 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVWWESFGTEWVNYKTTPPVLDSDGSFFLYSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 84 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21.17.1 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVLWESFGIEWSSYKTTPPVLDSDGSFFLYSKLTVT KEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 85 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21.17.2 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVLWESYGTEWASYKTTPPVLDSDGSFFLYSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 86 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21.17.3 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVLWESYGTEWVSYKTTPPVLDSDGSFFLYSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 87 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21.17.4 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVLWESYGTEWSSYKTTPPVLDSDGSFFLYSKLTVS KEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 88 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21.17.5 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVLWESFGTEWASYKTTPPVLDSDGSFFLYSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 89 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21.17.6 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVLWESFGTEWVSYKTTPPVLDSDGSFFLYSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 90 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.N390 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESYGTEWSNYKTTPPVLDSDGSFFLYSKLTV TKSEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 91 MPPPRTGRGLLWLGLVLSSVCVALGSETQANSTTDALNVLLIIVDD Full-length human LRPSLGCYGDKLVRSPNIDQLASHSLLFQNAFAQQAVCAPSRVSFLT iduronate sulfatase (IDS) GRRPDTTRLYDFNSYWRVHAGNFSTIPQYFKENGYVTMSVGKVFH polypeptide sequence PGISSNHTDDSPYSWSFPPYHPSSEKYENTKTCRGPDGELHANLLCP VDVLDVPEGTLPDKQSTEQAIQLLEKMKTSASPFFLAVGYHKPHIPF RYPKEFQKLYPLENITLAPDPEVPDGLPPVAYNPWMDIRQREDVQA LNISVPYGPIPVDFQRKIRQSYFASVSYLDTQVGRLLSALDDLQLAN STIIAFTSDHGWALGEHGEWAKYSNFDVATHVPLIFYVPGRTASLPE AGEKLFPYLDPFDSASQLMEPGRQSMDLVELVSLFPTLAGLAGLQV PPRCPVPSFHVELCREGKNLLKHFRFRDLEEDPYLPGNPRELIAYSQ YPRPSDIPQWNSDKPSLKDIKIMGYSIRTIDYRYTVWVGFNPDEFLA NFSDIHAGELYFVDSDPLQDHNMYNDSQGGDLFQLLMP 92 TDALNVLLIIVDDLRPSLGCYGDKLVRSPNIDQLASHSLLFQNAFAQ Mature human iduronate QAVCAPSRVSFLTGRRPDTTRLYDFNSYWRVHAGNFSTIPQYFKEN sulfatase (IDS) GYVTMSVGKVFHPGISSNHTDDSPYSWSFPPYHPSSEKYENTKTCR polypeptide sequence GPDGELHANLLCPVDVLDVPEGTLPDKQSTEQAIQLLEKMKTSASP FFLAVGYHKPHIPFRYPKEFQKLYPLENITLAPDPEVPDGLPPVAYN PWMDIRQREDVQALNISVPYGPIPVDFQRKIRQSYFASVSYLDTQV GRLLSALDDLQLANSTIIAFTSDHGWALGEHGEWAKYSNFDVATH VPLIFYVPGRTASLPEAGEKLFPYLDPFDSASQLMEPGRQSMDLVEL VSLFPTLAGLAGLQVPPRCPVPSFHVELCREGKNLLKHFRFRDLEED PYLPGNPRELIAYSQYPRPSDIPQWNSDKPSLKDIKIMGYSIRTIDYR YTVWVGFNPDEFLANFSDIHAGELYFVDSDPLQDHNMYNDSQGGD LFQLLMP 93 MEFSSPSREECPKPLSRVSIMAGSLTGLLLLQAVSWASGARPCIPKSF Full-length human β- GYSSVVCVCNATYCDSFDPPTFPALGTFSRYESTRSGRRMELSMGPI glucocerbosidase QANHTGTGLLLTLQPEQKFQKVKGFGGAMTDAAALNILALSPPAQ polypeptide sequence NLLLKSYFSEEGIGYNIIRVPMASCDFSIRTYTYADTPDDFQLHNFSL PEEDTKLKIPLIHRALQLAQRPVSLLASPWTSPTWLKTNGAVNGKG SLKGQPGDIYHQTWARYFVKFLDAYAEHKLQFWAVTAENEPSAGL LSGYPFQCLGFTPEHQRDFIARDLGPTLANSTHHNVRLLMLDDQRL LLPHWAKVVLTDPEAAKYVHGIAVHWYLDFLAPAKATLGETHRLF PNTMLFASEACVGSKFWEQSVRLGSWDRGMQYSHSIITNLLYHVV GWTDWNLALNPEGGPNWVRNFVDSPIIVDITKDTFYKQPMFYHLG HFSKFIPEGSQRVGLVASQKNDLDAVALMHPDGSAVVVVLNRSSK DVPLTIKDPAVGFLETISPGYSIHTYLWRRQ 94 ARPCIPKSFGYSSVVCVCNATYCDSFDPPTFPALGTFSRYESTRSGR Mature human β- RMELSMGPIQANHTGTGLLLTLQPEQKFQKVKGFGGAMTDAAALN glucocerbosidase ILALSPPAQNLLLKSYFSEEGIGYNIIRVPMASCDFSIRTYTYADTPD polypeptide sequence DFQLHNFSLPEEDTKLKIPLIHRALQLAQRPVSLLASPWTSPTWLKT NGAVNGKGSLKGQPGDIYHQTWARYFVKFLDAYAEHKLQFWAVT AENEPSAGLLSGYPFQCLGFTPEHQRDFIARDLGPTLANSTHHNVRL LMLDDQRLLLPHWAKVVLTDPEAAKYVHGIAVHWYLDFLAPAKA TLGETHRLFPNTMLFASEACVGSKFWEQSVRLGSWDRGMQYSHSII TNLLYHVVGWTDWNLALNPEGGPNWVRNFVDSPIIVDITKDTFYK QPMFYHLGHFSKFIPEGSQRVGLVASQKNDLDAVALMHPDGSAVV VVLNRSSKDVPLTIKDPAVGFLETISPGYSIHTYLWRRQ 95 EPKSCDKTHTCPPCP Human IgG1 hinge amino acid sequence 96 MMDQARSAFSNLFGGEPLSYTRFSLARQVDGDNSHVEMKLAVDEE Human transferrin ENADNNTKANVTKPKRCSGSICYGTIAVIVFFLIGFMIGYLGYCKGV receptor protein 1 EPKTECERLAGTESPVREEPGEDFPAARRLYWDDLKRKLSEKLDST (TFR1) DFTGTIKLLNENSYVPREAGSQKDENLALYVENQFREFKLSKVWRD QHFVKIQVKDSAQNSVIIVDKNGRLVYLVENPGGYVAYSKAATVT GKLVHANFGTKKDFEDLYTPVNGSIVIVRAGKITFAEKVANAESLN AIGVLIYMDQTKFPIVNAELSFFGHAHLGTGDPYTPGFPSFNHTQFP PSRSSGLPNIPVQTISRAAAEKLFGNMEGDCPSDWKTDSTCRMVTS ESKNVKLTVSNVLKEIKILNIFGVIKGFVEPDHYVVVGAQRDAWGP GAAKSGVGTALLLKLAQMFSDMVLKDGFQPSRSIIFASWSAGDFGS VGATEWLEGYLSSLHLKAFTYINLDKAVLGTSNFKVSASPLLYTLIE KTMQNVKHPVTGQFLYQDSNWASKVEKLTLDNAAFPFLAYSGIPA VSFCFCEDTDYPYLGTTMDTYKELIERIPELNKVARAAAEVAGQFVI KLTHDVELNLDYERYNSQLLSFVRDLNQYRADIKEMGLSLQWLYS ARGDFFRATSRLTTDFGNAEKTDRFVMKKLNDRVMRVEYHFLSPY VSPKESPFRHVFWGSGSHTLPALLENLKLRKQNNGAFNETLFRNQL ALATWTIQGAANALSGDVWDIDNEF 97 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21 with WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY knob mutation KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSL WCLVKGFYPSDIAVWWESYGTEWSSYKTTPPVLDSDGSFFLYSKLT VTKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 98 APEAAGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21 with WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY knob and LALA KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSL mutations WCLVKGFYPSDIAVWWESYGTEWSSYKTTPPVLDSDGSFFLYSKLT VTKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 99 APELLGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFNW Clone CH3C.35.21 with YVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKC knob and YTE mutations KVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLWCL VKGFYPSDIAVWWESYGTEWSSYKTTPPVLDSDGSFFLYSKLTVTK EEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 100 APEAAGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21 with WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY knob, LALA, and YTE KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSL mutations WCLVKGFYPSDIAVWWESYGTEWSSYKTTPPVLDSDGSFFLYSKLT VTKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 101 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Fc sequence with hole WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY mutations KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLS CAVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLVSKLTV DKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 102 APEAAGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Fc sequence with hole WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY and LALA mutations KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLS CAVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLVSKLTV DKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 103 APELLGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFNW Fc sequence with hole YVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKC and YTE mutations KVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLSCA VKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLVSKLTVDK SRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 104 APEAAGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFN Fc sequence with hole, WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY LALA, and YTE KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLS mutations CAVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLVSKLTV DKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 105 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21 with WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY hole mutations KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLS CAVKGFYPSDIAVWWESYGTEWSSYKTTPPVLDSDGSFFLVSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 106 APEAAGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21 with WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY hole and LALA KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLS mutations CAVKGFYPSDIAVWWESYGTEWSSYKTTPPVLDSDGSFFLVSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 107 APELLGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFNW Clone CH3C.35.21 with YVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKC hole and YTE mutations KVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLSCA VKGFYPSDIAVWWESYGTEWSSYKTTPPVLDSDGSFFLVSKLTVTK EEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 108 APEAAGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21 with WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY hole, LALA, and YTE KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLS mutations CAVKGFYPSDIAVWWESYGTEWSSYKTTPPVLDSDGSFFLVSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 109 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Fc sequence with knob WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY mutation KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSL WCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLT VDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 110 APEAAGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Fc sequence with knob WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY and LALA mutations KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSL WCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLT VDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 111 APELLGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFNW Fc sequence with knob YVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKC and YTE mutations KVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLWCL VKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDK SRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 112 APEAAGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFN Fc sequence with knob, WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY LALA, and YTE KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSL mutations WCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLT VDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 113 DKTHTCPPCP Portion of human IgG1 hinge sequence 114 SETQANSTTD ALNVLLIIVD DLRPSLGCYG DKLVRSPNID IDS sequence QLASHSLLFQ NAFAQQAVCA PSRVSFLTGR RPDTTRLYDF (cysteine modified to NSYWRVHAGN FSTIPQYFKE NGYVTMSVGK VFHPGISSNH formylglycine double TDDSPYSWSF PPYHPSSEKY ENTKTCRGPD GELHANLLCP underlined) VDVLDVPEGT LPDKQSTEQA IQLLEKMKTS ASPFFLAVGY HKPHIPFRYP KEFQKLYPLE NITLAPDPEV PDGLPPVAYN PWMDIRQRED VQALNISVPY GPIPVDFQRK IRQSYFASVS YLDTQVGRLL SALDDLQLAN STIIAFTSDH GWALGEHGEW AKYSNFDVAT HVPLIFYVPG RTASLPEAGE KLFPYLDPFD SASQLMEPGR QSMDLVELVS LFPTLAGLAG LQVPPRCPVP SFHVELCREG KNLLKHFRFR DLEEDPYLPG NPRELIAYSQ YPRPSDIPQW NSDKPSLKDI KIMGYSIRTI DYRYTVWVGF NPDEFLANFS DIHAGELYFV DSDPLQDHNM YNDSQGGDLF QLLMP 115 SETQANSTTD ALNVLLIIVD DLRPSLGCYG DKLVRSPNID IDS-Fc fusion QLASHSLLFQ NAFAQQAV C A PSRVSFLTGR RPDTTRLYDF polypeptide with IDS NSYWRVHAGN FSTIPQYFKE NGYVTMSVGK VFHPGISSNH sequence underlined TDDSPYSWSF PPYHPSSEKY ENTKTCRGPD GELHANLLCP (cysteine modified to VDVLDVPEGT LPDKQSTEQA IQLLEKMKTS ASPFFLAVGY formylglycine double HKPHIPFRYP KEFQKLYPLE NITLAPDPEV PDGLPPVAYN underlined) and hole and PWMDIRQRED VQALNISVPY GPIPVDFQRK IRQSYFASVS LALA mutations YLDTQVGRLL SALDDLQLAN STIIAFTSDH GWALGEHGEW AKYSNFDVAT HVPLIFYVPG RTASLPEAGE KLFPYLDPFD SASQLMEPGR QSMDLVELVS LFPTLAGLAG LQVPPRCPVP SFHVELCREG KNLLKHFRFR DLEEDPYLPG NPRELIAYSQ YPRPSDIPQW NSDKPSLKDI KIMGYSIRTI DYRYTVWVGF NPDEFLANFS DIHAGELYFV DSDPLQDHNM YNDSQGGDLF QLLMPGGGGS DKTHTCPPCP APEAAGGPSV FLFPPKPKDT LMISRTPEVT CVVVDVSHED PEVKFNWYVD GVEVHNAKTK PREEQYNSTY RVVSVLTVLH QDWLNGKEYK CKVSNKALPA PIEKTISKAK GQPREPQVYT LPPSRDELTK NQVSLSCAVK GFYPSDIAVE WESNGQPENN YKTTPPVLDS DGSFFLVSKL TVDKSRWQQG NVFSCSVMHE ALHNHYTQKS LSLSPGK 116 DKTHTCPPCP APEAAGGPSV FLFPPKPKDT LMISRTPEVT Clone CH3C.35.21 with CVVVDVSHED PEVKFNWYVD GVEVHNAKTK PREEQYNSTY knob and LALA RVVSVLTVLH QDWLNGKEYK CKVSNKALPA PIEKTISKAK mutations and portion of GQPREPQVYT LPPSRDELTK NQVSLWCLVK GFYPSDIAVW human IgG1 hinge WESYGTEWSS YKTTPPVLDS DGSFFLYSKL TVTKEEWQQG sequence FVFSCSVMHE ALHNHYTQKS LSLSPGK 117 SETQANSTTD ALNVLLIIVD DLRPSLGCYG DKLVRSPNID IDS-Fc fusion QLASHSLLFQ NAFAQQAV C A PSRVSFLTGR RPDTTRLYDF polypeptide with IDS NSYWRVHAGN FSTIPQYFKE NGYVTMSVGK VFHPGISSNH sequence underlined TDDSPYSWSF PPYHPSSEKY ENTKTCRGPD GELHANLLCP (cysteine modified to VDVLDVPEGT LPDKQSTEQA IQLLEKMKTS ASPFFLAVGY formylglycine double HKPHIPFRYP KEFQKLYPLE NITLAPDPEV PDGLPPVAYN underlined) and hole PWMDIRQRED VQALNISVPY GPIPVDFQRK IRQSYFASVS mutations YLDTQVGRLL SALDDLQLAN STIIAFTSDH GWALGEHGEW AKYSNFDVAT HVPLIFYVPG RTASLPEAGE KLFPYLDPFD SASQLMEPGR QSMDLVELVS LFPTLAGLAG LQVPPRCPVP SFHVELCREG KNLLKHFRFR DLEEDPYLPG NPRELIAYSQ YPRPSDIPQW NSDKPSLKDI KIMGYSIRTI DYRYTVWVGF NPDEFLANFS DIHAGELYFV DSDPLQDHNM YNDSQGGDLF QLLMPGGGGS DKTHTCPPCP APELLGGPSV FLFPPKPKDT LMISRTPEVT CVVVDVSHED PEVKFNWYVD GVEVHNAKTK PREEQYNSTY RVVSVLTVLH QDWLNGKEYK CKVSNKALPA PIEKTISKAK GQPREPQVYT LPPSRDELTK NQVSLSCAVK GFYPSDIAVE WESNGQPENN YKTTPPVLDS DGSFFLVSKL TVDKSRWQQG NVFSCSVMHE ALHNHYTQKS LSLSPGK 118 SETQANSTTD ALNVLLIIVD DLRPSLGCYG DKLVRSPNID IDS-Fc fusion QLASHSLLFQ NAFAQQAV C A PSRVSFLTGR RPDTTRLYDF polypeptide with IDS NSYWRVHAGN FSTIPQYFKE NGYVTMSVGK VFHPGISSNH sequence underlined TDDSPYSWSF PPYHPSSEKY ENTKTCRGPD GELHANLLCP (cysteine modified to VDVLDVPEGT LPDKQSTEQA IQLLEKMKTS ASPFFLAVGY formylglycine double HKPHIPFRYP KEFQKLYPLE NITLAPDPEV PDGLPPVAYN underlined) and knob PWMDIRQRED VQALNISVPY GPIPVDFQRK IRQSYFASVS mutation YLDTQVGRLL SALDDLQLAN STIIAFTSDH GWALGEHGEW AKYSNFDVAT HVPLIFYVPG RTASLPEAGE KLFPYLDPFD SASQLMEPGR QSMDLVELVS LFPTLAGLAG LQVPPRCPVP SFHVELCREG KNLLKHFRFR DLEEDPYLPG NPRELIAYSQ YPRPSDIPQW NSDKPSLKDI KIMGYSIRTI DYRYTVWVGF NPDEFLANFS DIHAGELYFV DSDPLQDHNM YNDSQGGDLF QLLMPGGGGS DKTHTCPPCP APELLGGPSV FLFPPKPKDT LMISRTPEVT CVVVDVSHED PEVKFNWYVD GVEVHNAKTK PREEQYNSTY RVVSVLTVLH QDWLNGKEYK CKVSNKALPA PIEKTISKAK GQPREPQVYT LPPSRDELTK NQVSLWCLVK GFYPSDIAVE WESNGQPENN YKTTPPVLDS DGSFFLYSKL TVDKSRWQQG NVFSCSVMHE ALHNHYTQKS LSLSPGK 119 MSCPVPACCALLLVLGLCRARPRNALLLLADDGGFESGAYNNSAIA Full-length human TPHLDALARRSLLFRNAFTSVSSCSPSRASLLTGLPQHQNGMYGLH sulfoglucosamine QDVHHFNSFDKVRSLPLLLSQAGVRTGIIGKKHVGPETVYPFDFAY sulfohydrolase TEENGSVLQVGRNITRIKLLVRKFLQTQDDRPFFLYVAFHDPHRCG polypeptide sequence HSQPQYGTFCEKFGNGESGMGRIPDWTPQAYDPLDVLVPYFVPNTP AARADLAAQYTTVGRMDQGVGLVLQELRDAGVLNDTLVIFTSDN GIPFPSGRTNLYWPGTAEPLLVSSPEHPKRWGQVSEAYVSLLDLTPT ILDWFSIPYPSYAIFGSKTIHLTGRSLLPALEAEPLWATVFGSQSHHE VTMSYPMRSVQHRHFRLVHNLNFKMPFPIDQDFYVSPTFQDLLNRT TAGQPTGWYKDLRHYYYRARWELYDRSRDPHETQNLATDPRFAQ LLEMLRDQLAKWQWETHDPWVCAPDGVLEEKLSPQCQPLHNEL 120 RPRNALLLLADDGGFESGAYNNSAIATPHLDALARRSLLFRNAFTS Mature human VSSCSPSRASLLTGLPQHQNGMYGLHQDVHHFNSFDKVRSLPLLLS sulfoglucosamine QAGVRTGIIGKKHVGPETVYPFDFAYTEENGSVLQVGRNITRIKLLV sulfohydrolase RKFLQTQDDRPFFLYVAFHDPHRCGHSQPQYGTFCEKFGNGESGM polypeptide sequence GRIPDWTPQAYDPLDVLVPYFVPNTPAARADLAAQYTTVGRMDQG VGLVLQELRDAGVLNDTLVIFTSDNGIPFPSGRTNLYWPGTAEPLLV SSPEHPKRWGQVSEAYVSLLDLTPTILDWFSIPYPSYAIFGSKTIHLT GRSLLPALEAEPLWATVFGSQSHHEVTMSYPMRSVQHRHFRLVHN LNFKMPFPIDQDFYVSPTFQDLLNRTTAGQPTGWYKDLRHYYYRA RWELYDRSRDPHETQNLATDPRFAQLLEMLRDQLAKWQWETHDP WVCAPDGVLEEKLSPQCQPLHNEL 121 MPRYGASLRQSCPRSGREQGQDGTAGAPGLLWMGLVLALALALA Full-length human acid LALSDSRVLWAPAEAHPLSPQGHPARLHRIVPRLRDVFGWGNLTCP sphingomyelinase ICKGLFTAINLGLKKEPNVARVGSVAIKLCNLLKIAPPAVCQSIVHLF polypeptide sequence EDDMVEVWRRSVLSPSEACGLLLGSTCGHWDIFSSWNISLPTVPKP PPKPPSPPAPGAPVSRILFLTDLHWDHDYLEGTDPDCADPLCCRRGS GLPPASRPGAGYWGEYSKCDLPLRTLESLLSGLGPAGPFDMVYWT GDIPAHDVWHQTRQDQLRALTTVTALVRKFLGPVPVYPAVGNHES TPVNSFPPPFIEGNHSSRWLYEAMAKAWEPWLPAEALRTLRIGGFY ALSPYPGLRLISLNMNFCSRENFWLLINSTDPAGQLQWLVGELQAA EDRGDKVHIIGHIPPGHCLKSWSWNYYRIVARYENTLAAQFFGHTH VDEFEVFYDEETLSRPLAVAFLAPSATTYIGLNPGYRVYQIDGNYSG SSHVVLDHETYILNLTQANIPGAIPHWQLLYRARETYGLPNTLPTA WHNLVYRMRGDMQLFQTFWFLYHKGHPPSEPCGTPCRLATLCAQ LSARADSPALCRHLMPDGSLPEAQSLWPRPLFC 122 LSDSRVLWAPAEAHPLSPQGHPARLHRIVPRLRDVFGWGNLTCPIC Mature human acid KGLFTAINLGLKKEPNVARVGSVAIKLCNLLKIAPPAVCQSIVHLFE sphingomyelinase DDMVEVWRRSVLSPSEACGLLLGSTCGHWDIFSSWNISLPTVPKPP polypeptide sequence PKPPSPPAPGAPVSRILFLTDLHWDHDYLEGTDPDCADPLCCRRGSG LPPASRPGAGYWGEYSKCDLPLRTLESLLSGLGPAGPFDMVYWTG DIPAHDVWHQTRQDQLRALTTVTALVRKFLGPVPVYPAVGNHEST PVNSFPPPFIEGNHSSRWLYEAMAKAWEPWLPAEALRTLRIGGFYA LSPYPGLRLISLNMNFCSRENFWLLINSTDPAGQLQWLVGELQAAE DRGDKVHIIGHIPPGHCLKSWSWNYYRIVARYENTLAAQFFGHTHV DEFEVFYDEETLSRPLAVAFLAPSATTYIGLNPGYRVYQIDGNYSGS SHVVLDHETYILNLTQANIPGAIPHWQLLYRARETYGLPNTLPTAW HNLVYRMRGDMQLFQTFWFLYHKGHPPSEPCGTPCRLATLCAQLS ARADSPALCRHLMPDGSLPEAQSLWPRPLFC 123 LSDSRVLWAPAEAHPLSPQGHPARLHRIVPRLRDVFGWGNLTCPIC Truncated human acid KGLFTAINLGLKKEPNVARVGSVAIKLCNLLKIAPPAVCQSIVHLFE sphingomyelinase DDMVEVWRRSVLSPSEACGLLLGSTCGHWDIFSSWNISLPTVPKPP polypeptide sequence PKPPSPPAPGAPVSRILFLTDLHWDHDYLEGTDPDCADPLCCRRGSG LPPASRPGAGYWGEYSKCDLPLRTLESLLSGLGPAGPFDMVYWTG DIPAHDVWHQTRQDQLRALTTVTALVRKFLGPVPVYPAVGNHEST PVNSFPPPFIEGNHSSRWLYEAMAKAWEPWLPAEALRTLRIGGFYA LSPYPGLRLISLNMNFCSRENFWLLINSTDPAGQLQWLVGELQAAE DRGDKVHIIGHIPPGHCLKSWSWNYYRIVARYENTLAAQFFGHTHV DEFEVFYDEETLSRPLAVAFLAPSATTYIGLNPGYRVYQIDGNYSGS SHVVLDHETYILNLTQANIPGAIPHWQLLYRARETYGLPNTLPTAW HNLVYRMRGDMQLFQTFWFLYHKGHPPSEPCGTPCRLATLCAQLS ARADSPALCRHLMPDGSLPEAQ 124 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3B.1 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPRFDYVTTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTV DKSRWQQGNVFSCSVMHEALHNHYGFHDLSLSPGK 125 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3B.2 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPRFDMVTTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTV DKSRWQQGNVFSCSVMHEALHNHYGFHDLSLSPGK 126 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3B.3 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPRFEYVTTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTV DKSRWQQGNVFSCSVMHEALHNHYGFHDLSLSPGK 127 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3B.4 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPRFEMVTTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTV DKSRWQQGNVFSCSVMHEALHNHYGFHDLSLSPGK 128 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3B.5 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPRFELVTTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTV DKSRWQQGNVFSCSVMHEALHNHYGFHDLSLSPGK 129 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVEFIW Clone CH2A2.1 YVDGVDVRYEWQLPREEQYNSTYRVVSVLTVLHQDWLNGKEYKC KVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCL VKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDK SRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 130 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVGFV Clone CH2A2.2 WYVDGVPVSWEWYWPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTV DKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 131 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVQFD Clone CH2A2.3 WYVDGVMVRREWHRPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTV DKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 132 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVSFEW Clone CH2A2.4 YVDGVPVRWEWQWPREEQYNSTYRVVSVLTVLHQDWLNGKEYK CKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTC LVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVD KSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 133 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVAFTW Clone CH2A2.5 YVDGVPVRWEWQNPREEQYNSTYRVVSVLTVLHQDWLNGKEYK CKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTC LVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVD KSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 134 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDPQTPPWEVKFN Clone CH2C.1 WYVDGVEVHNAKTKPREEEYYTYYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTV DKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 135 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDPPSPPWEVKFNW Clone CH2C.2 YVDGVEVHNAKTKPREEEYYSNYRVVSVLTVLHQDWLNGKEYKC KVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCL VKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDK SRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 136 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDPQTPPWEVKFN Clone CH2C.3 WYVDGVEVHNAKTKPREEEYYSNYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTV DKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 137 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDFRGPPWEVKFN Clone CH2C.4 WYVDGVEVHNAKTKPREEEYYHDYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTV DKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 138 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDPQTVPWEVKFN Clone CH2C.5 WYVDGVEVHNAKTKPREEEYYSNYRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTV DKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 139 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSVPPRMVKFN Clone CH2D.1 WYVDGVEVHNAKTKSLTSQHNSTVRVVSVLTVLHQDWLNGKEYK CKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTC LVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVD KSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 140 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSVPPWMVKFN Clone CH2D.2 WYVDGVEVHNAKTKSLTSQHNSTVRVVSVLTVLHQDWLNGKEYK CKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTC LVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVD KSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 141 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSDMWEYVKFN Clone CH2D.3 WYVDGVEVHNAKTKPWVKQLNSTWRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTV DKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 142 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSDDWTWVKFN Clone CH2D.4 WYVDGVEVHNAKTKPWIAQPNSTWRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTV DKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 143 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSDDWEWVKFN Clone CH2D.5 WYVDGVEVHNAKTKPWKLQLNSTWRVVSVLTVLHQDWLNGKEY KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTV DKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 144 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPWVWFY Clone CH2E3.1 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCSVVNIALWWSIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTV DKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 145 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPVVGFR Clone CH2E3.2 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCRVSNSALTWKIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTV DKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 146 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPVVGFR Clone CH2E3.3 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCRVSNSALSWRIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT CLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTV DKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 147 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPIVGFRW Clone CH2E3.4 YVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKC RVSNSALRWRIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCL VKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDK SRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 148 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPAVGFE Clone CH2E3.5 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY KCQVFNWALDWVIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSL TCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLT VDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 149 RPRNALLLLA DDGGFESGAY NNSAIATPHL DALARRSLLF SGSH-Fc fusion RNAFTSVSSC SPSRASLLTG LPQHQNGMYG LHQDVHHFNS polypeptide with mature FDKVRSLPLL LSQAGVRTGI IGKKHVGPET VYPFDFAYTE human SGSH sequence ENGSVLQVGR NITRIKLLVR KFLQTQDDRP FFLYVAFHDP (underlined) fused to the HRCGHSQPQY GTFCEKFGNG ESGMGRIPDW TPQAYDPLDV N-terminus of an Fc LVPYFVPNTP AARADLAAQY TTVGRMDQGV GLVLQELRDA sequence with hole and GVLNDTLVIF TSDNGIPFPS GRTNLYWPGT AEPLLVSSPE LALA mutations HPKRWGQVSE AYVSLLDLTP TILDWFSIPY PSYAIFGSKT IHLTGRSLLP ALEAEPLWAT VFGSQSHHEV TMSYPMRSVQ HRHFRLVHNL NFKMPFPIDQ DFYVSPTFQD LLNRTTAGQP TGWYKDLRHY YYRARWELYD RSRDPHETQN LATDPRFAQL LEMLRDQLAK WQWETHDPWV CAPDGVLEEK LSPQCQPLHN ELGGGGSDKT HTCPPCPAPE AAGGPSVFLF PPKPKDTLMI SRTPEVTCVV VDVSHEDPEV KFNWYVDGVE VHNAKTKPRE EQYNSTYRVV SVLTVLHQDW LNGKEYKCKV SNKALPAPIE KTISKAKGQP REPQVYTLPP SRDELTKNQV SLSCAVKGFY PSDIAVEWES NGQPENNYKT TPPVLDSDGS FFLVSKLTVD KSRWQQGNVF SCSVMHEALH NHYTQKSLSL SPGK 150 APEAAGGPSV FLFPPKPKDT LMISRTPEVT CVVVDVSHED SGSH-Fc fusion PEVKFNWYVD GVEVHNAKTK PREEQYNSTY RVVSVLTVLH polypeptide with mature QDWLNGKEYK CKVSNKALPA PIEKTISKAK GQPREPQVYT human SGSH sequence LPPSRDELTK NQVSLSCAVK GFYPSDIAVE WESNGQPENN (underlined) fused to the YKTTPPVLDS DGSFFLVSKL TVDKSRWQQG NVFSCSVMHE C-terminus of an Fc ALHNHYTQKS LSLSPGKGGG GSRPRNALLL LADDGGFESG sequence with hole and AYNNSAIATP HLDALARRSL LFRNAFTSVS SCSPSRASLL LALA mutations TGLPQHQNGM YGLHQDVHHF NSFDKVRSLP LLLSQAGVRT GIIGKKHVGP ETVYPFDFAY TEENGSVLQV GRNITRIKLL VRKFLQTQDD RPFFLYVAFH DPHRCGHSQP QYGTFCEKFG NGESGMGRIP DWTPQAYDPL DVLVPYFVPN TPAARADLAA QYTTVGRMDQ GVGLVLQELR DAGVLNDTLV IFTSDNGIPF PSGRTNLYWP GTAEPLLVSS PEHPKRWGQV SEAYVSLLDL TPTILDWFSI PYPSYAIFGS KTIHLTGRSL LPALEAEPLW ATVFGSQSHH EVTMSYPMRS VQHRHFRLVH NLNFKMPFPI DQDFYVSPTF QDLLNRTTAG QPTGWYKDLR HYYYRARWEL YDRSRDPHET QNLATDPRFA QLLEMLRDQL AKWQWETHDP WVCAPDGVLE EKLSPQCQPL HNEL 151 APEAAGGPSV FLFPPKPKDT LMISRTPEVT CVVVDVSHED Clone CH3C.35.21.17 PEVKFNWYVD GVEVHNAKTK PREEQYNSTY RVVSVLTVLH with knob and LALA QDWLNGKEYK CKVSNKALPA PIEKTISKAK GQPREPQVYT mutations LPPSRDELTK NQVSLWCLVK GFYPSDIAVL WESYGTEWSS YKTTPPVLDS DGSFFLYSKL TVTKEEWQQG FVFSCSVMHE ALHNHYTQKS LSLSPGK 152 RPRNALLLLA DDGGFESGAY NNSAIATPHL DALARRSLLF SGSH-Fc fusion RNAFTSVSSC SPSRASLLTG LPQHQNGMYG LHQDVHHFNS polypeptide with mature FDKVRSLPLL LSQAGVRTGI IGKKHVGPET VYPFDFAYTE human SGSH sequence ENGSVLQVGR NITRIKLLVR KFLQTQDDRP FFLYVAFHDP (underlined) fused to the HRCGHSQPQY GTFCEKFGNG ESGMGRIPDW TPQAYDPLDV N-terminus of an Fc LVPYFVPNTP AARADLAAQY TTVGRMDQGV GLVLQELRDA sequence with knob and GVLNDTLVIF TSDNGIPFPS GRTNLYWPGT AEPLLVSSPE LALA mutations HPKRWGQVSE AYVSLLDLTP TILDWFSIPY PSYAIFGSKT IHLTGRSLLP ALEAEPLWAT VFGSQSHHEV TMSYPMRSVQ HRHFRLVHNL NFKMPFPIDQ DFYVSPTFQD LLNRTTAGQP TGWYKDLRHY YYRARWELYD RSRDPHETQN LATDPRFAQL LEMLRDQLAK WQWETHDPWV CAPDGVLEEK LSPQCQPLHN ELGGGGSDKT HTCPPCPAPE AAGGPSVFLF PPKPKDTLMI SRTPEVTCVV VDVSHEDPEV KFNWYVDGVE VHNAKTKPRE EQYNSTYRVV SVLTVLHQDW LNGKEYKCKV SNKALPAPIE KTISKAKGQP REPQVYTLPP SRDELTKNQV SLWCLVKGFY PSDIAVEWES NGQPENNYKT TPPVLDSDGS FFLYSKLTVD KSRWQQGNVF SCSVMHEALH NHYTQKSLSL SPGK 153 APEAAGGPSV FLFPPKPKDT LMISRTPEVT CVVVDVSHED SGSH-Fc fusion PEVKFNWYVD GVEVHNAKTK PREEQYNSTY RVVSVLTVLH polypeptide with mature QDWLNGKEYK CKVSNKALPA PIEKTISKAK GQPREPQVYT human SGSH sequence LPPSRDELTK NQVSLWCLVK GFYPSDIAVE WESNGQPENN (underlined) fused to the YKTTPPVLDS DGSFFLYSKL TVDKSRWQQG NVFSCSVMHE C-terminus of an Fc ALHNHYTQKS LSLSPGKGGG GSRPRNALLL LADDGGFESG sequence with knob and AYNNSAIATP HLDALARRSL LFRNAFTSVS SCSPSRASLL LALA mutations TGLPQHQNGM YGLHQDVHHF NSFDKVRSLP LLLSQAGVRT GIIGKKHVGP ETVYPFDFAY TEENGSVLQV GRNITRIKLL VRKFLQTQDD RPFFLYVAFH DPHRCGHSQP QYGTFCEKFG NGESGMGRIP DWTPQAYDPL DVLVPYFVPN TPAARADLAA QYTTVGRMDQ GVGLVLQELR DAGVLNDTLV IFTSDNGIPF PSGRTNLYWP GTAEPLLVSS PEHPKRWGQV SEAYVSLLDL TPTILDWFSI PYPSYAIFGS KTIHLTGRSL LPALEAEPLW ATVFGSQSHH EVTMSYPMRS VQHRHFRLVH NLNFKMPFPI DQDFYVSPTF QDLLNRTTAG QPTGWYKDLR HYYYRARWEL YDRSRDPHET QNLATDPRFA QLLEMLRDQL AKWQWETHDP WVCAPDGVLE EKLSPQCQPL HNEL 154 RPRNALLLLA DDGGFESGAY NNSAIATPHL DALARRSLLF SGSH-Fc fusion RNAFTSVSSC SPSRASLLTG LPQHQNGMYG LHQDVHHFNS polypeptide with mature FDKVRSLPLL LSQAGVRTGI IGKKHVGPET VYPFDFAYTE human SGSH sequence ENGSVLQVGR NITRIKLLVR KFLQTQDDRP FFLYVAFHDP (underlined) fused to the HRCGHSQPQY GTFCEKFGNG ESGMGRIPDW TPQAYDPLDV N-terminus of clone LVPYFVPNTP AARADLAAQY TTVGRMDQGV GLVLQELRDA CH3C.35.21.17 with GVLNDTLVIF TSDNGIPFPS GRTNLYWPGT AEPLLVSSPE knob and LALA HPKRWGQVSE AYVSLLDLTP TILDWFSIPY PSYAIFGSKT mutations IHLTGRSLLP ALEAEPLWAT VFGSQSHHEV TMSYPMRSVQ HRHFRLVHNL NFKMPFPIDQ DFYVSPTFQD LLNRTTAGQP TGWYKDLRHY YYRARWELYD RSRDPHETQN LATDPRFAQL LEMLRDQLAK WQWETHDPWV CAPDGVLEEK LSPQCQPLHN ELGGGGSDKT HTCPPCPAPE AAGGPSVFLF PPKPKDTLMI SRTPEVTCVV VDVSHEDPEV KFNWYVDGVE VHNAKTKPRE EQYNSTYRVV SVLTVLHQDW LNGKEYKCKV SNKALPAPIE KTISKAKGQP REPQVYTLPP SRDELTKNQV SLWCLVKGFY PSDIAVLWES YGTEWSSYKT TPPVLDSDGS FFLYSKLTVT KEEWQQGFVF SCSVMHEALH NHYTQKSLSL SPGK 155 APEAAGGPSV FLFPPKPKDT LMISRTPEVT CVVVDVSHED SGSH-Fc fusion PEVKFNWYVD GVEVHNAKTK PREEQYNSTY RVVSVLTVLH polypeptide with mature QDWLNGKEYK CKVSNKALPA PIEKTISKAK GQPREPQVYT human SGSH sequence LPPSRDELTK NQVSLWCLVK GFYPSDIAVL WESYGTEWSS (underlined) fused to the YKTTPPVLDS DGSFFLYSKL TVTKEEWQQG FVFSCSVMHE C-terminus of clone ALHNHYTQKS LSLSPGKGGG GSRPRNALLL LADDGGFESG CH3C.35.21.17 with AYNNSAIATP HLDALARRSL LFRNAFTSVS SCSPSRASLL knob and LALA TGLPQHQNGM YGLHQDVHHF NSFDKVRSLP LLLSQAGVRT mutations GIIGKKHVGP ETVYPFDFAY TEENGSVLQV GRNITRIKLL VRKFLQTQDD RPFFLYVAFH DPHRCGHSQP QYGTFCEKFG NGESGMGRIP DWTPQAYDPL DVLVPYFVPN TPAARADLAA QYTTVGRMDQ GVGLVLQELR DAGVLNDTLV IFTSDNGIPF PSGRTNLYWP GTAEPLLVSS PEHPKRWGQV SEAYVSLLDL TPTILDWFSI PYPSYAIFGS KTIHLTGRSL LPALEAEPLW ATVFGSQSHH EVTMSYPMRS VQHRHFRLVH NLNFKMPFPI DQDFYVSPTF QDLLNRTTAG QPTGWYKDLR HYYYRARWEL YDRSRDPHET QNLATDPRFA QLLEMLRDQL AKWQWETHDP WVCAPDGVLE EKLSPQCQPL HNEL 156 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.20.1 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with knob mutation KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSL WCLVKGFYPSDIAVEWESFGTEWSSYKTTPPVLDSDGSFFLYSKLT VTKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 157 APEAAGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.20.1 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with knob and LALA KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSL mutations WCLVKGFYPSDIAVEWESFGTEWSSYKTTPPVLDSDGSFFLYSKLT VTKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 158 APEAAGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.20.1 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with knob and LALAPG KCKVSNKALGAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSL mutations WCLVKGFYPSDIAVEWESFGTEWSSYKTTPPVLDSDGSFFLYSKLT VTKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 159 APELLGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFNW Clone CH3C.35.20.1 YVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKC with knob and YTE KVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLWCL mutations VKGFYPSDIAVEWESFGTEWSSYKTTPPVLDSDGSFFLYSKLTVTKE EWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 160 APEAAGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.20.1 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with knob, LALA, and KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSL YTE mutations WCLVKGFYPSDIAVEWESFGTEWSSYKTTPPVLDSDGSFFLYSKLT VTKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 161 APEAAGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.20.1 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with knob, LALAPG, KCKVSNKALGAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSL and YTE mutations WCLVKGFYPSDIAVEWESFGTEWSSYKTTPPVLDSDGSFFLYSKLT VTKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 162 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.20.1 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with hole mutations KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLS CAVKGFYPSDIAVEWESFGTEWSSYKTTPPVLDSDGSFFLVSKLTVT KEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 163 APEAAGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.20.1 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with hole and LALA KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLS mutations CAVKGFYPSDIAVEWESFGTEWSSYKTTPPVLDSDGSFFLVSKLTVT KEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 164 APEAAGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.20.1 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with hole and LALAPG KCKVSNKALGAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLS mutations CAVKGFYPSDIAVEWESFGTEWSSYKTTPPVLDSDGSFFLVSKLTVT KEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 165 APELLGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFNW Clone CH3C.35.20.1 YVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKC with hole and YTE KVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLSCA mutations VKGFYPSDIAVEWESFGTEWSSYKTTPPVLDSDGSFFLVSKLTVTKE EWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 166 APEAAGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.20.1 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with hole, LALA, and KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLS YTE mutations CAVKGFYPSDIAVEWESFGTEWSSYKTTPPVLDSDGSFFLVSKLTVT KEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 167 APEAAGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.20.1 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with hole, LALAPG, and KCKVSNKALGAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLS YTE mutations CAVKGFYPSDIAVEWESFGTEWSSYKTTPPVLDSDGSFFLVSKLTVT KEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 168 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23.2 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with knob mutation KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSL WCLVKGFYPSDIAVEWESYGTEWANYKTTPPVLDSDGSFFLYSKLT VTKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 169 APEAAGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23.2 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with knob and LALA KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSL mutations WCLVKGFYPSDIAVEWESYGTEWANYKTTPPVLDSDGSFFLYSKLT VTKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 170 APEAAGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23.2 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with knob and LALAPG KCKVSNKALGAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSL mutations WCLVKGFYPSDIAVEWESYGTEWANYKTTPPVLDSDGSFFLYSKLT VTKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 171 APELLGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFNW Clone CH3C.35.23.2 YVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKC with knob and YTE KVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLWCL mutations VKGFYPSDIAVEWESYGTEWANYKTTPPVLDSDGSFFLYSKLTVTK EEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 172 APEAAGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23.2 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with knob, LALA, and KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSL YTE mutations WCLVKGFYPSDIAVEWESYGTEWANYKTTPPVLDSDGSFFLYSKLT VTKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 173 APEAAGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23.2 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with knob, LALAPG, KCKVSNKALGAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSL and YTE mutations WCLVKGFYPSDIAVEWESYGTEWANYKTTPPVLDSDGSFFLYSKLT VTKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 174 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23.2 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with hole mutations KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLS CAVKGFYPSDIAVEWESYGTEWANYKTTPPVLDSDGSFFLVSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 175 APEAAGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23.2 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with hole and LALA KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLS mutations CAVKGFYPSDIAVEWESYGTEWANYKTTPPVLDSDGSFFLVSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 176 APEAAGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23.2 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with hole and LALAPG KCKVSNKALGAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLS mutations CAVKGFYPSDIAVEWESYGTEWANYKTTPPVLDSDGSFFLVSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 177 APELLGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFNW Clone CH3C.35.23.2 YVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKC with hole and YTE KVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLSCA mutations VKGFYPSDIAVEWESYGTEWANYKTTPPVLDSDGSFFLVSKLTVTK EEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 178 APEAAGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23.2 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with hole, LALA, and KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLS YTE mutations CAVKGFYPSDIAVEWESYGTEWANYKTTPPVLDSDGSFFLVSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 179 APEAAGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23.2 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with hole, LALAPG, and KCKVSNKALGAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLS YTE mutations CAVKGFYPSDIAVEWESYGTEWANYKTTPPVLDSDGSFFLVSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 180 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23.3 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with knob mutation KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSL WCLVKGFYPSDIAVEWESYGTEWVNYKTTPPVLDSDGSFFLYSKLT VTKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 181 APEAAGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23.3 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with knob and LALA KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSL mutations WCLVKGFYPSDIAVEWESYGTEWVNYKTTPPVLDSDGSFFLYSKLT VTKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 182 APEAAGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23.3 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with knob and LALAPG KCKVSNKALGAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSL mutations WCLVKGFYPSDIAVEWESYGTEWVNYKTTPPVLDSDGSFFLYSKLT VTKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 183 APELLGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFNW Clone CH3C.35.23.3 YVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKC with knob and YTE KVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLWCL mutations VKGFYPSDIAVEWESYGTEWVNYKTTPPVLDSDGSFFLYSKLTVTK EEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 184 APEAAGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23.3 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with knob, LALA, and KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSL YTE mutations WCLVKGFYPSDIAVEWESYGTEWVNYKTTPPVLDSDGSFFLYSKLT VTKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 185 APEAAGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23.3 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with knob, LALAPG, KCKVSNKALGAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSL and YTE mutations WCLVKGFYPSDIAVEWESYGTEWVNYKTTPPVLDSDGSFFLYSKLT VTKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 186 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23.3 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with hole mutations KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLS CAVKGFYPSDIAVEWESYGTEWVNYKTTPPVLDSDGSFFLVSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 187 APEAAGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23.3 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with hole and LALA KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLS mutations CAVKGFYPSDIAVEWESYGTEWVNYKTTPPVLDSDGSFFLVSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 188 APEAAGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23.3 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with hole and LALAPG KCKVSNKALGAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLS mutations CAVKGFYPSDIAVEWESYGTEWVNYKTTPPVLDSDGSFFLVSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 189 APELLGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFNW Clone CH3C.35.23.3 YVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKC with hole and YTE KVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLSCA mutations VKGFYPSDIAVEWESYGTEWVNYKTTPPVLDSDGSFFLVSKLTVTK EEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 190 APEAAGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23.3 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with hole, LALA, and KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLS YTE mutations CAVKGFYPSDIAVEWESYGTEWVNYKTTPPVLDSDGSFFLVSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 191 APEAAGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23.3 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with hole, LALAPG, and KCKVSNKALGAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLS YTE mutations CAVKGFYPSDIAVEWESYGTEWVNYKTTPPVLDSDGSFFLVSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 192 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23.4 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with knob mutation KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSL WCLVKGFYPSDIAVEWESYGTEWSNYKTTPPVLDSDGSFFLYSKLT VSKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 193 APEAAGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23.4 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with knob and LALA KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSL mutations WCLVKGFYPSDIAVEWESYGTEWSNYKTTPPVLDSDGSFFLYSKLT VSKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 194 APEAAGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23.4 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with knob and LALAPG KCKVSNKALGAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSL mutations WCLVKGFYPSDIAVEWESYGTEWSNYKTTPPVLDSDGSFFLYSKLT VSKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 195 APELLGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFNW Clone CH3C.35.23.4 YVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKC with knob and YTE KVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLWCL mutations VKGFYPSDIAVEWESYGTEWSNYKTTPPVLDSDGSFFLYSKLTVSK EEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 196 APEAAGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23.4 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with knob, LALA, and KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSL YTE mutations WCLVKGFYPSDIAVEWESYGTEWSNYKTTPPVLDSDGSFFLYSKLT VSKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 197 APEAAGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23.4 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with knob, LALAPG, KCKVSNKALGAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSL and YTE mutations WCLVKGFYPSDIAVEWESYGTEWSNYKTTPPVLDSDGSFFLYSKLT VSKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 198 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23.4 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with hole mutations KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLS CAVKGFYPSDIAVEWESYGTEWSNYKTTPPVLDSDGSFFLVSKLTV SKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 199 APEAAGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23.4 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with hole and LALA KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLS mutations CAVKGFYPSDIAVEWESYGTEWSNYKTTPPVLDSDGSFFLVSKLTV SKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 200 APEAAGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23.4 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with hole and LALAPG KCKVSNKALGAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLS mutations CAVKGFYPSDIAVEWESYGTEWSNYKTTPPVLDSDGSFFLVSKLTV SKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 201 APELLGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFNW Clone CH3C.35.23.4 YVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKC with hole and YTE KVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLSCA mutations VKGFYPSDIAVEWESYGTEWSNYKTTPPVLDSDGSFFLVSKLTVSK EEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 202 APEAAGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23.4 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with hole, LALA, and KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLS YTE mutations CAVKGFYPSDIAVEWESYGTEWSNYKTTPPVLDSDGSFFLVSKLTV SKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 203 APEAAGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23.4 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with hole, LALAPG, and KCKVSNKALGAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLS YTE mutations CAVKGFYPSDIAVEWESYGTEWSNYKTTPPVLDSDGSFFLVSKLTV SKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 204 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21.17.2 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with knob mutation KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSL WCLVKGFYPSDIAVLWESYGTEWASYKTTPPVLDSDGSFFLYSKLT VTKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 205 APEAAGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21.17.2 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with knob and LALA KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSL mutations WCLVKGFYPSDIAVLWESYGTEWASYKTTPPVLDSDGSFFLYSKLT VTKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 206 APEAAGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21.17.2 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with knob and LALAPG KCKVSNKALGAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSL mutations WCLVKGFYPSDIAVLWESYGTEWASYKTTPPVLDSDGSFFLYSKLT VTKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 207 APELLGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFNW Clone CH3C.35.21.17.2 YVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKC with knob and YTE KVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLWCL mutations VKGFYPSDIAVLWESYGTEWASYKTTPPVLDSDGSFFLYSKLTVTK EEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 208 APEAAGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21.17.2 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with knob, LALA, and KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSL YTE mutations WCLVKGFYPSDIAVLWESYGTEWASYKTTPPVLDSDGSFFLYSKLT VTKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 209 APEAAGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21.17.2 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with knob, LALAPG, KCKVSNKALGAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSL and YTE mutations WCLVKGFYPSDIAVLWESYGTEWASYKTTPPVLDSDGSFFLYSKLT VTKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 210 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21.17.2 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with hole mutations KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLS CAVKGFYPSDIAVLWESYGTEWASYKTTPPVLDSDGSFFLVSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 211 APEAAGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21.17.2 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with hole and LALA KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLS mutations CAVKGFYPSDIAVLWESYGTEWASYKTTPPVLDSDGSFFLVSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 212 APEAAGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21.17.2 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with hole and LALAPG KCKVSNKALGAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLS mutations CAVKGFYPSDIAVLWESYGTEWASYKTTPPVLDSDGSFFLVSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 213 APELLGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFNW Clone CH3C.35.21.17.2 YVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKC with hole and YTE KVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLSCA mutations VKGFYPSDIAVLWESYGTEWASYKTTPPVLDSDGSFFLVSKLTVTK EEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 214 APEAAGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21.17.2 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with hole, LALA, and KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLS YTE mutations CAVKGFYPSDIAVLWESYGTEWASYKTTPPVLDSDGSFFLVSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 215 APEAAGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.21.17.2 WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY with hole, LALAPG, and KCKVSNKALGAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLS YTE mutations CAVKGFYPSDIAVLWESYGTEWASYKTTPPVLDSDGSFFLVSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 216 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23 with WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY knob mutation KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSL WCLVKGFYPSDIAVEWESYGTEWSNYKTTPPVLDSDGSFFLYSKLT VTKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 217 APEAAGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23 with WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY knob and LALA KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSL mutations WCLVKGFYPSDIAVEWESYGTEWSNYKTTPPVLDSDGSFFLYSKLT VTKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 218 APEAAGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23 with WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY knob and LALAPG KCKVSNKALGAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSL mutations WCLVKGFYPSDIAVEWESYGTEWSNYKTTPPVLDSDGSFFLYSKLT VTKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 219 APELLGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFNW Clone CH3C.35.23 with YVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKC knob and YTE mutations KVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLWCL VKGFYPSDIAVEWESYGTEWSNYKTTPPVLDSDGSFFLYSKLTVTK EEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 220 APEAAGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23 with WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY knob, LALA, and YTE KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSL mutations WCLVKGFYPSDIAVEWESYGTEWSNYKTTPPVLDSDGSFFLYSKLT VTKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 221 APEAAGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23 with WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY knob, LALAPG, and KCKVSNKALGAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSL YTE mutations WCLVKGFYPSDIAVEWESYGTEWSNYKTTPPVLDSDGSFFLYSKLT VTKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 222 APELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23 with WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY hole mutations KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLS CAVKGFYPSDIAVEWESYGTEWSNYKTTPPVLDSDGSFFLVSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 223 APEAAGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23 with WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY hole and LALA KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLS mutations CAVKGFYPSDIAVEWESYGTEWSNYKTTPPVLDSDGSFFLVSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 224 APEAAGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23 with WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY hole and LALAPG KCKVSNKALGAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLS mutations CAVKGFYPSDIAVEWESYGTEWSNYKTTPPVLDSDGSFFLVSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 225 APELLGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFNW Clone CH3C.35.23 with YVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEYKC hole and YTE mutations KVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLSCA VKGFYPSDIAVEWESYGTEWSNYKTTPPVLDSDGSFFLVSKLTVTK EEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 226 APEAAGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23 with WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY hole, LALA, and YTE KCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLS mutations CAVKGFYPSDIAVEWESYGTEWSNYKTTPPVLDSDGSFFLVSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 227 APEAAGGPSVFLFPPKPKDTLYITREPEVTCVVVDVSHEDPEVKFN Clone CH3C.35.23 with WYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDWLNGKEY hole, LALAPG, and KCKVSNKALGAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLS YTE mutations CAVKGFYPSDIAVEWESYGTEWSNYKTTPPVLDSDGSFFLVSKLTV TKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK 228 DKTHTCPPCPAPEAAGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVS Clone CH3C.35.21.17.2 HEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQ with knob and LALA DWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDE mutations and portion of LTKNQVSLWCLVKGFYPSDIAVLWESYGTEWASYKTTPPVLDSDG human IgG1 hinge SFFLYSKLTVTKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK sequence 229 DKTHTCPPCPAPEAAGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVS Clone CH3C.35.23.2 HEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQ with knob and LALA DWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDE mutations and portion of LTKNQVSLWCLVKGFYPSDIAVEWESYGTEWANYKTTPPVLDSDG human IgG1 hinge SFFLYSKLTVTKEEWQQGFVFSCSVMHEALHNHYTQKSLSLSPGK sequence 230 SETQANSTTD ALNVLLIIVD DLRPSLGCYG DKLVRSPNID IDS sequence QLASHSLLFQ NAFAQQAVCA PSRVSFLTGR RPDTTRLYDF NSYWRVHAGN FSTIPQYFKE NGYVTMSVGK VFHPGISSNH TDDSPYSWSF PPYHPSSEKY ENTKTCRGPD GELHANLLCP VDVLDVPEGT LPDKQSTEQA IQLLEKMKTS ASPFFLAVGY HKPHIPFRYP KEFQKLYPLE NITLAPDPEV PDGLPPVAYN PWMDIRQRED VQALNISVPY GPIPVDFQRK IRQSYFASVS YLDTQVGRLL SALDDLQLAN STIIAFTSDH GWALGEHGEW AKYSNFDVAT HVPLIFYVPG RTASLPEAGE KLFPYLDPFD SASQLMEPGR QSMDLVELVS LFPTLAGLAG LQVPPRCPVP SFHVELCREG KNLLKHFRFR DLEEDPYLPG NPRELIAYSQ YPRPSDIPQW NSDKPSLKDI KIMGYSIRTI DYRYTVWVGF NPDEFLANFS DIHAGELYFV DSDPLQDHNM YNDSQGGDLF QLLMP 231 SETQANSTTD ALNVLLIIVD DLRPSLGCYG DKLVRSPNID IDS-Fc fusion QLASHSLLFQ NAFAQQAVCA PSRVSFLTGR RPDTTRLYDF polypeptide with IDS NSYWRVHAGN FSTIPQYFKE NGYVTMSVGK VFHPGISSNH sequence underlined and TDDSPYSWSF PPYHPSSEKY ENTKTCRGPD GELHANLLCP hole and LALA VDVLDVPEGT LPDKQSTEQA IQLLEKMKTS ASPFFLAVGY mutations HKPHIPFRYP KEFQKLYPLE NITLAPDPEV PDGLPPVAYN PWMDIRQRED VQALNISVPY GPIPVDFQRK IRQSYFASVS YLDTQVGRLL SALDDLQLAN STIIAFTSDH GWALGEHGEW AKYSNFDVAT HVPLIFYVPG RTASLPEAGE KLFPYLDPFD SASQLMEPGR QSMDLVELVS LFPTLAGLAG LQVPPRCPVP SFHVELCREG KNLLKHFRFR DLEEDPYLPG NPRELIAYSQ YPRPSDIPQW NSDKPSLKDI KIMGYSIRTI DYRYTVWVGF NPDEFLANFS DIHAGELYFV DSDPLQDHNM YNDSQGGDLF QLLMPGGGGS DKTHTCPPCP APEAAGGPSV FLFPPKPKDT LMISRTPEVT CVVVDVSHED PEVKFNWYVD GVEVHNAKTK PREEQYNSTY RVVSVLTVLH QDWLNGKEYK CKVSNKALPA PIEKTISKAK GQPREPQVYT LPPSRDELTK NQVSLSCAVK GFYPSDIAVE WESNGQPENN YKTTPPVLDS DGSFFLVSKL TVDKSRWQQG NVFSCSVMHE ALHNHYTQKS LSLSPGK 232 SETQANSTTD ALNVLLIIVD DLRPSLGCYG DKLVRSPNID IDS-Fc fusion QLASHSLLFQ NAFAQQAVCA PSRVSFLTGR RPDTTRLYDF polypeptide with IDS NSYWRVHAGN FSTIPQYFKE NGYVTMSVGK VFHPGISSNH sequence underlined and TDDSPYSWSF PPYHPSSEKY ENTKTCRGPD GELHANLLCP hole mutations VDVLDVPEGT LPDKQSTEQA IQLLEKMKTS ASPFFLAVGY HKPHIPFRYP KEFQKLYPLE NITLAPDPEV PDGLPPVAYN PWMDIRQRED VQALNISVPY GPIPVDFQRK IRQSYFASVS YLDTQVGRLL SALDDLQLAN STIIAFTSDH GWALGEHGEW AKYSNFDVAT HVPLIFYVPG RTASLPEAGE KLFPYLDPFD SASQLMEPGR QSMDLVELVS LFPTLAGLAG LQVPPRCPVP SFHVELCREG KNLLKHFRFR DLEEDPYLPG NPRELIAYSQ YPRPSDIPQW NSDKPSLKDI KIMGYSIRTI DYRYTVWVGF NPDEFLANFS DIHAGELYFV DSDPLQDHNM YNDSQGGDLF QLLMPGGGGS DKTHTCPPCP APELLGGPSV FLFPPKPKDT LMISRTPEVT CVVVDVSHED PEVKFNWYVD GVEVHNAKTK PREEQYNSTY RVVSVLTVLH QDWLNGKEYK CKVSNKALPA PIEKTISKAK GQPREPQVYT LPPSRDELTK NQVSLSCAVK GFYPSDIAVE WESNGQPENN YKTTPPVLDS DGSFFLVSKL TVDKSRWQQG NVFSCSVMHE ALHNHYTQKS LSLSPGK 233 SETQANSTTD ALNVLLIIVD DLRPSLGCYG DKLVRSPNID IDS-Fc fusion QLASHSLLFQ NAFAQQAVCA PSRVSFLTGR RPDTTRLYDF polypeptide with IDS NSYWRVHAGN FSTIPQYFKE NGYVTMSVGK VFHPGISSNH sequence underlined and TDDSPYSWSF PPYHPSSEKY ENTKTCRGPD GELHANLLCP knob mutation VDVLDVPEGT LPDKQSTEQA IQLLEKMKTS ASPFFLAVGY HKPHIPFRYP KEFQKLYPLE NITLAPDPEV PDGLPPVAYN PWMDIRQRED VQALNISVPY GPIPVDFQRK IRQSYFASVS YLDTQVGRLL SALDDLQLAN STIIAFTSDH GWALGEHGEW AKYSNFDVAT HVPLIFYVPG RTASLPEAGE KLFPYLDPFD SASQLMEPGR QSMDLVELVS LFPTLAGLAG LQVPPRCPVP SFHVELCREG KNLLKHFRFR DLEEDPYLPG NPRELIAYSQ YPRPSDIPQW NSDKPSLKDI KIMGYSIRTI DYRYTVWVGF NPDEFLANFS DIHAGELYFV DSDPLQDHNM YNDSQGGDLF QLLMPGGGGS DKTHTCPPCP APELLGGPSV FLFPPKPKDT LMISRTPEVT CVVVDVSHED PEVKFNWYVD GVEVHNAKTK PREEQYNSTY RVVSVLTVLH QDWLNGKEYK CKVSNKALPA PIEKTISKAK GQPREPQVYT LPPSRDELTK NQVSLWCLVK GFYPSDIAVE WESNGQPENN YKTTPPVLDS DGSFFLYSKL TVDKSRWQQG NVFSCSVMHE ALHNHYTQKS LSLSPGK 234 SETQANSTTD ALNVLLIIVD DLRPSLGCYG DKLVRSPNID IDS sequence QLASHSLLFQ NAFAQQAVfGA PSRVSFLTGR RPDTTRLYDF (formylglycine residue NSYWRVHAGN FSTIPQYFKE NGYVTMSVGK VFHPGISSNH ″fG″ double underlined) TDDSPYSWSF PPYHPSSEKY ENTKTCRGPD GELHANLLCP VDVLDVPEGT LPDKQSTEQA IQLLEKMKTS ASPFFLAVGY HKPHIPFRYP KEFQKLYPLE NITLAPDPEV PDGLPPVAYN PWMDIRQRED VQALNISVPY GPIPVDFQRK IRQSYFASVS YLDTQVGRLL SALDDLQLAN STIIAFTSDH GWALGEHGEW AKYSNFDVAT HVPLIFYVPG RTASLPEAGE KLFPYLDPFD SASQLMEPGR QSMDLVELVS LFPTLAGLAG LQVPPRCPVP SFHVELCREG KNLLKHFRFR DLEEDPYLPG NPRELIAYSQ YPRPSDIPQW NSDKPSLKDI KIMGYSIRTI DYRYTVWVGF NPDEFLANFS DIHAGELYFV DSDPLQDHNM YNDSQGGDLF QLLMP 235 SETQANSTTD ALNVLLIIVD DLRPSLGCYG DKLVRSPNID IDS-Fc fusion QLASHSLLFQ NAFAQQAV fG A PSRVSFLTGR RPDTTRLYDF polypeptide with IDS NSYWRVHAGN FSTIPQYFKE NGYVTMSVGK VFHPGISSNH sequence underlined TDDSPYSWSF PPYHPSSEKY ENTKTCRGPD GELHANLLCP (formylglycine residue VDVLDVPEGT LPDKQSTEQA IQLLEKMKTS ASPFFLAVGY ″fG″ double underlined) HKPHIPFRYP KEFQKLYPLE NITLAPDPEV PDGLPPVAYN and hole and LALA PWMDIRQRED VQALNISVPY GPIPVDFQRK IRQSYFASVS mutations YLDTQVGRLL SALDDLQLAN STIIAFTSDH GWALGEHGEW AKYSNFDVAT HVPLIFYVPG RTASLPEAGE KLFPYLDPFD SASQLMEPGR QSMDLVELVS LFPTLAGLAG LQVPPRCPVP SFHVELCREG KNLLKHFRFR DLEEDPYLPG NPRELIAYSQ YPRPSDIPQW NSDKPSLKDI KIMGYSIRTI DYRYTVWVGF NPDEFLANFS DIHAGELYFV DSDPLQDHNM YNDSQGGDLF QLLMPGGGGS DKTHTCPPCP APEAAGGPSV FLFPPKPKDT LMISRTPEVT CVVVDVSHED PEVKFNWYVD GVEVHNAKTK PREEQYNSTY RVVSVLTVLH QDWLNGKEYK CKVSNKALPA PIEKTISKAK GQPREPQVYT LPPSRDELTK NQVSLSCAVK GFYPSDIAVE WESNGQPENN YKTTPPVLDS DGSFFLVSKL TVDKSRWQQG NVFSCSVMHE ALHNHYTQKS LSLSPGK 236 SETQANSTTD ALNVLLIIVD DLRPSLGCYG DKLVRSPNID IDS-Fc fusion QLASHSLLFQ NAFAQQAV fG A PSRVSFLTGR RPDTTRLYDF polypeptide with IDS NSYWRVHAGN FSTIPQYFKE NGYVTMSVGK VFHPGISSNH sequence underlined TDDSPYSWSF PPYHPSSEKY ENTKTCRGPD GELHANLLCP (formylglycine residue VDVLDVPEGT LPDKQSTEQA IQLLEKMKTS ASPFFLAVGY ″fG″ double underlined) HKPHIPFRYP KEFQKLYPLE NITLAPDPEV PDGLPPVAYN and hole mutations PWMDIRQRED VQALNISVPY GPIPVDFQRK IRQSYFASVS YLDTQVGRLL SALDDLQLAN STIIAFTSDH GWALGEHGEW AKYSNFDVAT HVPLIFYVPG RTASLPEAGE KLFPYLDPFD SASQLMEPGR QSMDLVELVS LFPTLAGLAG LQVPPRCPVP SFHVELCREG KNLLKHFRFR DLEEDPYLPG NPRELIAYSQ YPRPSDIPQW NSDKPSLKDI KIMGYSIRTI DYRYTVWVGF NPDEFLANFS DIHAGELYFV DSDPLQDHNM YNDSQGGDLF QLLMPGGGGS DKTHTCPPCP APELLGGPSV FLFPPKPKDT LMISRTPEVT CVVVDVSHED PEVKFNWYVD GVEVHNAKTK PREEQYNSTY RVVSVLTVLH QDWLNGKEYK CKVSNKALPA PIEKTISKAK GQPREPQVYT LPPSRDELTK NQVSLSCAVK GFYPSDIAVE WESNGQPENN YKTTPPVLDS DGSFFLVSKL TVDKSRWQQG NVFSCSVMHE ALHNHYTQKS LSLSPGK 237 SETQANSTTD ALNVLLIIVD DLRPSLGCYG DKLVRSPNID IDS-Fc fusion QLASHSLLFQ NAFAQQAV fG A PSRVSFLTGR RPDTTRLYDF polypeptide with IDS NSYWRVHAGN FSTIPQYFKE NGYVTMSVGK VFHPGISSNH sequence underlined TDDSPYSWSF PPYHPSSEKY ENTKTCRGPD GELHANLLCP (formylglycine residue VDVLDVPEGT LPDKQSTEQA IQLLEKMKTS ASPFFLAVGY ″fG″ double underlined) HKPHIPFRYP KEFQKLYPLE NITLAPDPEV PDGLPPVAYN and knob mutation PWMDIRQRED VQALNISVPY GPIPVDFQRK IRQSYFASVS YLDTQVGRLL SALDDLQLAN STIIAFTSDH GWALGEHGEW AKYSNFDVAT HVPLIFYVPG RTASLPEAGE KLFPYLDPFD SASQLMEPGR QSMDLVELVS LFPTLAGLAG LQVPPRCPVP SFHVELCREG KNLLKHFRFR DLEEDPYLPG NPRELIAYSQ YPRPSDIPQW NSDKPSLKDI KIMGYSIRTI DYRYTVWVGF NPDEFLANFS DIHAGELYFV DSDPLQDHNM YNDSQGGDLF QLLMPGGGGS DKTHTCPPCP APELLGGPSV FLFPPKPKDT LMISRTPEVT CVVVDVSHED PEVKFNWYVD GVEVHNAKTK PREEQYNSTY RVVSVLTVLH QDWLNGKEYK CKVSNKALPA PIEKTISKAK GQPREPQVYT LPPSRDELTK NQVSLWCLVK GFYPSDIAVE WESNGQPENN YKTTPPVLDS DGSFFLYSKL TVDKSRWQQG NVFSCSVMHE ALHNHYTQKS LSLSPGK 238 NSVIIVDKNGRLVYLVENPGGYVAYSKAATVTGKLVHANFGTKKD Human TfR apical FEDLYTPVNGSIVIVRAGKITFAEKVANAESLNAIGVLIYMDQTKFPI domain VNAELSFFGHAHLGTGDPYTPGFPSFNHTQFPPSRSSGLPNIPVQTIS RAAAEKLFGNMEGDCPSDWKTDSTCRMVTSESKNVKLTVS 239 GGGGS Glycine-rich linker 240 GGGGSGGGGS Glycine-rich linker 241 HHHHHH Hexahistidine tag 242 YxTEWSS Library motif 

What is claimed is:
 1. A protein comprising: a. a fusion polypeptide comprising a first Fc polypeptide linked to an iduronate 2-sulfatase (IDS) amino acid sequence, wherein the IDS amino acid sequence comprises a sequence having at least 90% identity to SEQ ID NO: 92, and wherein the IDS amino acid sequence is linked to the N-terminus of the first Fc polypeptide; and b. a second Fc polypeptide comprising a sequence having at least 90% identity to SEQ ID NO: 151 that is capable of specifically binding to a transferrin receptor.
 2. The protein of claim 1, wherein the protein is capable of being transported across the blood-brain barrier of a primate.
 3. The protein of claim 1, wherein the sequence having at least 90% identity to SEQ ID NO: 92 is selected from the group consisting of SEQ ID NOs: 92, 230, and
 234. 4. The protein of claim 1, wherein the fusion polypeptide comprises the amino acid sequence of SEQ ID NO: 115 or
 235. 5. The protein of claim 1, wherein the sequence having at least 90% identity to SEQ ID NO: 151 has at the following positions with reference to SEQ ID NO: 151: a. Trp, Leu, or Glu at position 150; b. Tyr at position 154; c. Thr at position 156; d. Glu at position 157; e. Trp at position 158; f. Ser or Ala at position 159; g. Ser or Asn at position 160; h. Thr at position 183; i. Glu at position 185; j. Glu at position 186; and k. Phe at position
 191. 6. The protein of claim 5, wherein the sequence having at least 90% identity to SEQ ID NO: 151 has at the following positions with reference to SEQ ID NO: 151: a. Leu at position 150; b. Ala at position 159; and c. Ser at position
 160. 7. The protein of claim 5, wherein the sequence having at least 90% identity to SEQ ID NO: 151 has at the following positions with reference to SEQ ID NO: 151: a. Glu at position 150; b. Ala at position 159; and c. Asn at position
 160. 8. The protein of claim 5, wherein the sequence having at least 90% identity to SEQ ID NO: 151 has at the following positions with reference to SEQ ID NO: 151: a. Trp at position 150; b. Ser at position 159; and c. Ser at position
 160. 9. The protein of claim 5, wherein the sequence having at least 90% identity to SEQ ID NO: 151 has at the following positions with reference to SEQ ID NO: 151: a. Leu at position 150; b. Ser at position 159; and c. Ser at position
 160. 10. The protein of claim 1, wherein the fusion polypeptide comprises the sequence of SEQ ID NO: 115 or 235, and the sequence having at least 90% identity to SEQ ID NO: 151 is SEQ ID NO:
 98. 11. The protein of claim 1, wherein the fusion polypeptide comprises the sequence of SEQ ID NO: 115 or 235, and the sequence having at least 90% identity to SEQ ID NO: 151 is SEQ ID NO:
 116. 12. The protein of claim 1, wherein the fusion polypeptide comprises the sequence of SEQ ID NO: 115 or 235, and the sequence having at least 90% identity to SEQ ID NO: 151 is SEQ ID NO:
 151. 13. The protein of claim 12, wherein the second Fc polypeptide further comprises, at the N-terminus of SEQ ID NO: 151, the amino acid sequence of SEQ ID NO:
 113. 14. The protein of claim 1, wherein the fusion polypeptide comprises the sequence of SEQ ID NO: 115 or 235, and the sequence having at least 90% identity to SEQ ID NO: 151 is SEQ ID NO:
 205. 15. The protein of claim 1, wherein the fusion polypeptide comprises the sequence of SEQ ID NO: 115 or 235, and the sequence having at least 90% identity to SEQ ID NO: 151 is SEQ ID NO:
 228. 16. The protein of claim 1, wherein the fusion polypeptide comprises the sequence of SEQ ID NO: 115 or 235, and the sequence having at least 90% identity to SEQ ID NO: 151 is SEQ ID NO:
 169. 17. The protein of claim 1, wherein the fusion polypeptide comprises the sequence of SEQ ID NO: 115 or 235, and the sequence having at least 90% identity to SEQ ID NO: 151 is SEQ ID NO:
 229. 18. A pharmaceutical composition comprising the protein of claim 1 and a pharmaceutically acceptable excipient.
 19. A protein comprising: a. a fusion polypeptide comprising a first Fc polypeptide linked to an iduronate 2-sulfatase (IDS) amino acid sequence, wherein the fusion polypeptide comprises SEQ ID NO: 235; and b. a second Fc polypeptide comprising a sequence having at least 90% identity to SEQ ID NO: 151 and having at the following positions with reference to SEQ ID NO: 151: i. Trp, Leu, or Glu at position 150; ii. Tyr at position 154; iii. Thr at position 156; iv. Glu at position 157; v. Trp at position 158; vi. Ser or Ala at position 159; vii. Ser or Asn at position 160; viii. Thr at position 183; ix. Glu at position 185; x. Glu at position 186; and xi. Phe at position
 191. 20. The protein of claim 19, wherein the second Fc polypeptide is SEQ ID NO:
 116. 21. The protein of claim 19, wherein the second Fc polypeptide is the combination of SEQ ID NO: 151 and SEQ ID NO: 113, wherein SEQ ID NO: 113 is at the N-terminus of SEQ ID NO:
 151. 22. The protein of claim 19, wherein the second Fc polypeptide is SEQ ID NO:
 228. 23. The protein of claim 19, wherein the second Fc polypeptide is SEQ ID NO:
 229. 24. A pharmaceutical composition comprising the protein of claim 19 and a pharmaceutically acceptable excipient.
 25. A protein comprising: a. a fusion polypeptide comprising a first Fc polypeptide linked to an iduronate 2-sulfatase (IDS) amino acid sequence, wherein the IDS amino acid sequence comprises a sequence having at least 90% identity to SEQ ID NO: 92, and wherein the IDS amino acid sequence is linked to the N-terminus of the first Fc polypeptide; and b. a second Fc polypeptide comprising a sequence having at least 90% identity to SEQ ID NO: 151 and having the following eight residues at respective positions with reference to SEQ ID NO: 151: i. Tyr at position 154; ii. Thr at position 156; iii. Glu at position 157; iv. Trp at position 158; v. Thr at position 183; vi. Glu at position 185; vii. Glu at position 186; and viii. Phe at position
 191. 26. The protein of claim 25, wherein the sequence having at least 90% identity to SEQ ID NO: 151 has at the following positions with reference to SEQ ID NO: 151: ix. Trp, Leu, or Glu at position 150; x. Ser or Ala at position 159; and xi. Ser or Asn at position
 160. 27. A pharmaceutical composition comprising the protein of claim 25 and a pharmaceutically acceptable excipient.
 28. The protein of claim 25, wherein the fusion polypeptide is at least 95% identical to SEQ ID NO: 235, and the second Fc polypeptide is at least 95% identical to SEQ ID NO:
 98. 29. The protein of claim 25, wherein the fusion polypeptide is at least 95% identical to SEQ ID NO: 235, and the second Fc polypeptide is at least 95% identical to SEQ ID NO:
 151. 30. The protein of claim 25, wherein the fusion polypeptide is at least 95% identical to SEQ ID NO: 235, and the second Fc polypeptide is at least 95% identical to SEQ ID NO:
 205. 31. The protein of claim 25, wherein the fusion polypeptide is at least 95% identical to SEQ ID NO: 235, and the second Fc polypeptide is at least 95% identical to SEQ ID NO:
 169. 32. A protein comprising: a. a fusion polypeptide comprising a first Fc polypeptide linked to an iduronate 2-sulfatase (IDS) amino acid sequence, wherein the fusion polypeptide is at least 95% identical to SEQ ID NO: 235; and b. a second Fc polypeptide that is at least 95% identical to SEQ ID NO:
 116. 33. A pharmaceutical composition comprising the protein of claim 32 and a pharmaceutically acceptable excipient.
 34. A protein comprising: a. a fusion polypeptide comprising a first Fc polypeptide linked to an iduronate 2-sulfatase (IDS) amino acid sequence, wherein the fusion polypeptide is at least 95% identical to SEQ ID NO: 235; and b. a second Fc polypeptide that is at least 95% identical to the combination of SEQ ID NO: 151 and SEQ ID NO: 113, wherein SEQ ID NO: 113 is at the N-terminus SEQ ID NO:
 151. 35. A pharmaceutical composition comprising the protein of claim 34 and a pharmaceutically acceptable excipient.
 36. A protein comprising: a. a fusion polypeptide comprising a first Fc polypeptide linked to an iduronate 2-sulfatase (IDS) amino acid sequence, wherein the fusion polypeptide is at least 95% identical to SEQ ID NO: 235; and b. a second Fc polypeptide that is at least 95% identical to SEQ ID NO:
 228. 37. A pharmaceutical composition comprising the protein of claim 36 and a pharmaceutically acceptable excipient.
 38. A protein comprising: a. a fusion polypeptide comprising a first Fc polypeptide linked to an iduronate 2-sulfatase (IDS) amino acid sequence, wherein the fusion polypeptide is at least 95% identical to SEQ ID NO: 235; and b. a second Fc polypeptide that is at least 95% identical to SEQ ID NO:
 229. 39. A pharmaceutical composition comprising the protein of claim 38 and a pharmaceutically acceptable excipient.
 40. A protein comprising: a. a fusion polypeptide comprising a first Fc polypeptide linked to an iduronate 2-sulfatase (IDS) amino acid sequence, wherein the fusion polypeptide is SEQ ID NO: 235; and b. a second Fc polypeptide that is SEQ ID NO:
 116. 41. A protein comprising: a. a fusion polypeptide comprising a first Fc polypeptide linked to an iduronate 2-sulfatase (IDS) amino acid sequence, wherein the fusion polypeptide is SEQ ID NO: 235; and b. a second Fc polypeptide that is the combination of SEQ ID NO: 113 and SEQ ID NO: 151, wherein SEQ ID NO: 113 is at the N-terminus of SEQ ID NO:
 151. 42. A protein comprising: a. a fusion polypeptide comprising a first Fc polypeptide linked to an iduronate 2-sulfatase (IDS) amino acid sequence, wherein the fusion polypeptide is SEQ ID NO: 235; and b. a second Fc polypeptide that is SEQ ID NO:
 228. 43. A protein comprising: a. a fusion polypeptide comprising a first Fc polypeptide linked to an iduronate 2-sulfatase (IDS) amino acid sequence, wherein the fusion polypeptide is SEQ ID NO: 235; and b. a second Fc polypeptide that is SEQ ID NO:
 229. 44. The protein of claim 1, wherein the IDS amino acid sequence is linked to the first Fc polypeptide by a linker.
 45. The protein of claim 1, wherein the IDS amino acid sequence is directly joined to the first Fc polypeptide.
 46. The protein of claim 25, wherein the IDS amino acid sequence is linked to the first Fc polypeptide by a linker.
 47. The protein of claim 25, wherein the IDS amino acid sequence is directly joined to the first Fc polypeptide. 